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Preface 


This document defines the PowerPC User Instruction 
Set Architecture. It covers the base instruction set 
and related facilities available to the application pro- 
grammer. 


Other related documents define the PowerPC Virtual 
Environment Architecture, the PowerPC Operating 
Environment Architecture, and PowerPC Implementa- 
tion Features. Book II, PowerPC Virtual Environment 
Architecture defines the storage model and related 
instructions and facilities available to the application 
programmer, and the time-keeping facilities available 
to the application programmer. Book Ill, PowerPC 


Operating Environment Architecture defines the 
system (privileged) instructions and related facilities. 
Book IV, PowerPC Implementation Features defines 
the implementation-dependent aspects of a particular 
implementation. 


As used in this document, the term “PowerPC Archi- 
tecture” refers to the instructions and facilities 
described in Books |, Il, and Ill. The description of the 
instantiation of the PowerPC Architecture in a given 
implementation includes also the material in Book IV 
for that implementation. 
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1.1 Overview 


This chapter describes computation modes, compat- 
ibility with the POWER Architecture, document con- 
ventions, a processor overview, instruction formats, 
storage addressing, and instruction fetching. 


1.2 Computation Modes 


Processors provide two execution enviroments, 32-bit 
and 64-bit. In both of these environments (modes), 
instructions that set a 64-bit register affect all 64 bits, 
and the value placed into the register is independent 
of mode. 


1.3 Instruction Mnemonics and 
Operands 


The description of each instruction includes the mne- 
monic and a formatted list of operands. Some exam- 
ples are the following. 


stw RS,D(RA) 
addis RT,RA,SI 


PowerPC-compliant Assemblers will support the mne- 
monics and operand lists exactly as shown. They 
should also provide certain extended mnemonics, as 
described in Appendix B, “Assembler Extended 
Mnemonics” on page 148. 
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1.4 Compatibility with the 
POWER Architecture 


The PowerPC Architecture provides binary compat- 
ibility for POWER application programs, except as 
described in Appendix E, “Incompatibilities with the 
POWER Architecture” on page 163. 


Many of the PowerPC instructions are identical to 
POWER instructions. For some of these the PowerPC 
instruction name and/or mnemonic differs from that in 
POWER. To assist readers familiar with the POWER 
Architecture, POWER mnemonics are shown with the 
individual instruction descriptions when they differ 
from the PowerPC mnemonics. Also, Appendix D, 
“Cross-Reference for Changed POWER Mnemonics” 
on page 161 provides a cross-reference from POWER 
mnemonics to PowerPC mnemonics for the 
instructions in Books |, Il, and Ill. 


References to the POWER Architecture include 
POWER2 implementations of the POWER Architecture 
unless otherwise stated. 


1.5 Document Conventions 


1.5.1 Definitions and Notation 


The following definitions and notation are used 
throughout the PowerPC Architecture documents. 


= A program is a sequence of related instructions. 


m Quadwords are 128 bits, doublewords are 64 bits, 
words are 32 bits, halfwords are 16 bits, and 
bytes are 8 bits. 


= All numbers are decimal unless specified in some 
special way. 


— Obnnnn means a number expressed in binary 
format. 

— Oxnnnn means a number expressed in 
hexadecimal format. 


Underscores may be used between digits. 


= RT, RA, Rij, ... refer to General Purpose Regis- 
ters. 


= FRT, FRA, FRi1, ... refer to Floating-Point Regis- 
ters. 


m= (xX) means the contents of register x, where x is 
the name of an instruction field. For example, 
(RA) means the contents of register RA, and 
(FRA) means the contents of register FRA, where 
RA and FRA are instruction fields. Names such 
as LR and CTR denote registers, not fields, so 
parentheses are not used with them. Paren- 
theses are also omitted when register x is the 


register into which the result of an operation is 
placed. 


(RA|O) means the contents of register RA if the 
RA field has the value 1-31, or the value 0 if the 
RA field is 0. 


Bits in registers, instructions, and fields are spec- 
ified as follows. 


— Bits are numbered left to right, starting with 
bit 0. 

— Ranges of bits are specified by two numbers 
separated by a colon (:). The range p:q con- 
sists of bits p through q. 


Xp means bit p of register/field X. 
X 
X 


p:q Means bits p through q of register/field X. 


pq... means bits p, q, ... of register/field X. 


a(RA) means the one's complement of the con- 
tents of register RA. 


Field i refers to bits 4xi through 4xi+3 of a reg- 
ister. 


A period (.) as the last character of an instruction 
mnemonic means that the instruction records 
status information in certain fields of the Condi- 
tion Register as a side effect of execution, as 
described in Chapter 2 through Chapter 4. 


The symbol Il is used to describe the concat- 
enation of two values. For example, 010 Il 111 is 
the same as 010111. 


x" means x raised to the n'” power. 


"x means the replication of x, n times (i.e., x con- 
catenated to itself n—1 times). "0 and ™ are 
special cases: 


— "0 means a field of n bits with each bit equal 
to 0. Thus 50 is equivalent to 0b00000. 

— "1 means a field of n bits with each bit equal 
to 1. Thus 51 is equivalent to 0b11111. 


Positive means greater than zero. 
Negative means less than zero. 


A system library program is a component of the 
system software that can be called by an applica- 
tion program using a Branch instruction. 


A system service program is a component of the 
system software that can be called by an applica- 
tion program using a System Call instruction. 


The system trap handler is a component of the 
system software that receives control when the 
conditions specified in a Trap instruction are sat- 
isfied. 


The system error handler is a component of the 
system software that receives control when an 
error occurs. The system error handler includes 
a component for each of the various kinds of 
error. These error-specific components are 
referred to as the system alignment error 
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handler, the system data storage error handler, 
etc. 


mw Each bit and field in instructions, and in status 
and control registers (e.g., XER, FPSCR) and 
Special Purpose Registers, is either defined or 
reserved. 


a /,//, ///, ... denotes a reserved field in an instruc- 
tion. 


m Latency refers to the interval from the time an 
instruction begins execution until it produces a 
result that is available for use by a subsequent 
instruction. 


m Unavailable refers to a resource that cannot be 
used by the program. For example, storage is 
unavailable if access to it is denied. See Book Ill, 
PowerPC Operating Environment Architecture. 


mu The results of executing a given instruction are 
said to be boundedly undefined if they could have 
been achieved by executing an arbitrary finite 
sequence of instructions (none of which yields 
boundedly undefined results) in the state the 
processor was in before executing the given 
instruction. _Boundedly undefined results may 
include the presentation of inconsistent state to 
the system error handler as described in the 
section entitled “Concurrent Modification and 
Execution of Instructions” in Book Il. Boundedly 
undefined results for a given instruction may vary 
between implementations, and between different 
executions on the same implementation, and are 
not further defined in this document. 


m= The sequential execution model is the model of 
program execution described in Section 2.2, 
“Instruction Execution Order” on page 17. 


1.5.2 Reserved Fields 


Reserved fields in instructions are ignored by the 
processor. 


The handling of reserved bits in System Registers 
(e.g., XER, FPSCR) is implementation-dependent. 
Unless otherwise stated, software is permitted to 
write any value to such a bit. A subsequent reading 
of the bit returns 0 if the value last written to the bit 
was 0 and returns an undefined value (0 or 1) other- 
wise. 





Programming Note 


Reserved fields in instructions should be coded as 
zero, and reserved bits in System Registers 
should be set to zero, because these fields and 
bits may be assigned a meaning in some future 
version of the architecture, such that the value 
zero will be consistent with the “old behavior”. 





Programming Note 


It is the responsibility of software to preserve bits 
that are now reserved in System Registers, as 
they may be assigned a meaning in some future 
version of the architecture. 


In order to accomplish this preservation in imple- 
mentation-independent fashion, software should 
do the following. 


m Initialize each such register supplying zeros 
for all reserved bits. 

a Alter (defined) bit(s) in the register by reading 
the register, altering only the desired bit(s), 
and then writing the new value back to the 
register. 


The XER and FPSCR are partial exceptions to this 
recommendation. Software can alter the status 
bits in these registers, preserving the reserved 
bits, by executing instructions that have the side 
effect of altering the status bits. Similarly, soft- 
ware can alter any defined bit in the FPSCR by 
executing a Floating-Point Status and Control Reg- 
ister instruction. Using such instructions is likely 
to yield better performance than using the method 
described in the second item above. 


When a currently reserved bit is subsequently 
assigned a meaning, every effort will be made to 
have the value to which the system initializes the 
bit correspond to the “old behavior’. 
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1.5.3 Description of Instruction 
Operation 


A formal description is given of the operation of each 
instruction. In addition, the operation of most 
instructions is described by a semiformal language at 
the register transfer level (RTL). This RTL uses the 
notation given below, in addition to the definitions and 
notation described in Section 1.5.1, “Definitions and 
Notation” on page 2. Some of this notation is also 
used in the formal descriptions of instructions. RTL 
notation not summarized here should be _ self- 
explanatory. 


The RTL descriptions cover the normal execution of 
the instruction, except that “standard” setting of the 
Condition Register, Fixed-Point Exception Register, 
and Floating-Point Status and Control Register are not 
shown. (“Non-standard” setting of these registers, 
such as the setting of the Condition Register by the 
Compare instructions, is shown.) The RTL 
descriptions do not cover cases in which the system 
error handler is invoked, or for which the results are 
boundedly undefined. 


The RTL descriptions specify the architectural trans- 
formation performed by the execution of an instruc- 
tion. They do not imply any particular implementation. 


Notation Meaning 
¢ Assignment 
¢€ iea Assignment of an instruction effec- 


tive address. In 32-bit mode the 
high-order 32 bits of the 64-bit target 
address are set to 0. 

A NOT logical operator 


+ Two's complement addition 

- Two's complement subtraction, unary 
minus 

x Multiplication 

+ Division (yielding quotient) 

v Square root 

ee Equals, Not Equals relations 

<,<,>,2 Signed comparison relations 

<, Unsigned comparison relations 

2 Unordered comparison relation 

&, | AND, OR logical operators 

e,= Exclusive OR, Equivalence logical 
operators ((a=b) = (a®-7b)) 

ABS (x) Absolute value of x 

CEIL(x) Least integer > x 

DOUBLE(x) Result of converting x from floating- 
point single format to floating-point 
double format, using the model 
shown on page 97 

EXTS(x) Result of extending x on the left with 
sign bits 

FLOOR(x) Greatest integer < x 

GPR(x) General Purpose Register x 

MASK(x, y) Mask having 1s in positions x 


through y (wrapping if x > y) and Os 
elsewhere 


MEM(x, y) 


ROTL¢a(x, Y) 


ROTL30(X, y) 


SINGLE(x) 


SPREG(x) 
TRAP 
characterization 


undefined 


CIA 


NIA 


if... then ... else 


do 


leave 


Contents of y bytes of storage 
starting at address x. In 32-bit mode 
the high-order 32 bits of the 64-bit 
value x are ignored. 

Result of rotating the 64-bit value x 
left y positions 

Result of rotating the 64-bit value 
xlIx left y positions, where x is 32 
bits long 

Result of converting x from floating- 
point double format to floating-point 
single format, using the model shown 
on page 100 

Special Purpose Register x 

Invoke the system trap handler 
Reference to the setting of status 
bits, in a standard way that is 
explained in the text 

An undefined value. The value may 
vary between implementations, and 
between different executions on the 
same implementation. 

Current Instruction Address, which is 
the 64-bit address of the instruction 
being described by a sequence of 
RTL. Used by relative branches to 
set the Next Instruction Address 
(NIA), and by Branch instructions 
with LK=1 to set the Link Register. 
In 32-bit mode the high-order 32 bits 
of CIA are always set to 0. Does not 
correspond to any architected reg- 
ister. 

Next Instruction Address, which is 
the 64-bit address of the next 
instruction to be executed. For a 
successful branch, the next instruc- 
tion address is the branch target 
address: in RTL, this is indicated by 
assigning a value to NIA. For other 
instructions that cause non- 
sequential instruction fetching (see 
Book Ill, PowerPC Operating Envi- 
ronment Architecture), the RTL is 
similar. For instructions that do not 
branch, and do not otherwise cause 
instruction fetching to be non- 
sequential, the next instruction 
address is CIA+4. In 32-bit mode 
the high-order 32 bits of NIA are 
always set to 0. Does not corre- 
spond to any architected register. 


.. Conditional execution, indenting 


shows range; else is optional. 

Do loop, indenting shows range. 
“To” and/or “by” clauses specify 
incrementing an iteration variable, 
and a “while” clause gives termi- 
nation conditions. 

Leave innermost do loop, or do loop 
described in leave statement. 





4 PowerPC User Instruction Set Architecture 


Version 2.01 





for For loop, indenting shows range. 
Clause after “for” specifies the enti- 
ties for which to execute the body of 
the loop. 


The precedence rules for RTL operators are summa- 
rized in Table 1. Operators higher in the table are 
applied before those lower in the table. Operators at 
the same level in the table associate from left to 
right, from right to left, or not at all, as shown. (For 
example, — associates from left to right, so a~b-—c = 
(a—b)—c.) Parentheses are used to override the eval- 
uation order implied by the table or to increase 
clarity; parenthesized expressions are evaluated 
before serving as operands. 





Table 1. Operator precedence 








Operators 


Associativity 
left to right 
right to left 


subscript, function evaluation 


pre-superscript (replication), 
post-superscript (exponentiation) 





unary —,7 right to left 





xX, left to right 
left to right 
left to right 





left to right 








left to right 
left to right 


none 





¢ none 
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1.6 Processor Overview 


The processor implements the instruction set, the 
storage model, and other facilities defined in this doc- 
ument. Instructions that the processor can execute 
fall into three classes: 


m branch instructions 
m fixed-point instructions 
m floating-point instructions 


Branch instructions are described in Section 2.4, 
“Branch Processor Instructions” on page 20. Fixed- 
point instructions are described in Section 3.3, “Fixed- 
Point Processor Instructions” on page 31. 
Floating-point instructions are described in Section 
4.6, “Floating-Point Processor Instructions” on 
page 97. 


Fixed-point instructions operate on byte, halfword, 
word, and doubleword operands. Floating-point 
instructions operate on single-precision and double- 
precision floating-point operands. The PowerPC 
Architecture uses instructions that are four bytes long 
and word-aligned. It provides for byte, halfword, 
word, and doubleword operand fetches and stores 
between storage and a set of 32 General Purpose 
Registers (GPRs). It also provides for word and 
doubleword operand fetches and stores between 
storage and a set of 32 Floating-Point Registers 
(FPRs). 


Signed integers are represented in two's complement 
form. 


There are no computational instructions that modify 
storage. To use a storage operand in a computation 
and then modify the same or another storage 
location, the contents of the storage operand must be 
loaded into a register, modified, and then stored back 
to the target location. Figure 1 is a logical represen- 
tation of instruction processing. Figure 2 on page 7 
shows the registers of the PowerPC User Instruction 
Set Architecture. 





Branch 
(————_> | Processing 











Fixed-Point and 
Floating-Point 








Instructions 
Fixed-Pt Float-Pt 
Processing Processing 

















Data to/from 
Storage 


Storage 























Instructions 
from Storage 


Figure 1. Logical processing model 
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J 


0 63 


CTR Count Register (page 19) 





XER Fixed-Point Exception Register (page 30) 


oO 
for) 
wo 





| FPSCR | 





Figure 2. PowerPC user register set 


1.7 Instruction Formats 


All instructions are four bytes long and word-aligned. 
Thus, whenever instruction addresses are presented 
to the processor (as in Branch instructions) the low- 
order two bits are ignored. Similarly, whenever the 
processor develops an instruction address the low- 
order two bits are zero. 


Bits 0:5 always specify the opcode (OPCD, below). 
Many instructions also have an extended opcode (XO, 
below). The remaining bits of the instruction contain 
one or more fields as shown below for the different 
instruction formats. 


The format diagrams given below show horizontally 
all valid combinations of instruction fields. The dia- 
grams include instruction fields that are used only by 
instructions defined in Book Il, PowerPC Virtual Envi- 
ronment Architecture, or in Book Ill, PowerPC Oper- 
ating Environment Architecture. 


Condition Register (page 18) 


Link Register (page 19) 


General Purpose Registers (page 29) 


Floating-Point Registers (page 82) 


Floating-Point Status and Control Register (page 83) 





If an instruction is coded such that a field contains a 
value that is not valid for the field (e.g., the values 0, 
1, and 2 are defined for the field but it contains a 
value of 3), the instruction form is invalid (see Section 
1.9.2, “Invalid Instruction Forms” on page 13). 


Split Field Notation 


In some cases an instruction field occupies more than 
one contiguous sequence of bits, or occupies one con- 
tiguous sequence of bits that are used in permuted 
order. Such a field is called a split field. In the 
format diagrams given below and in the individual 
instruction layouts, the name of a split field is shown 
in small letters, once for each of the contiguous 
sequences. In the RTL description of an instruction 
having a split field, and in certain other places where 
individual bits of a split field are identified, the name 
of the field in small letters represents the concat- 
enation of the sequences from left to right. In all 
other places, the name of the field is capitalized and 
represents the concatenation of the sequences in 
some order, which need not be left to right, as 
described for each affected instruction. 
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1.7.1 |-Form 


0 6 30 31 


Figure 3. | instruction format 


= 
A 


1.7.2 B-Form 


0 6 11 16 30 31 
forcp | Bo | et | Bo AA 


Figure 4. B instruction format 


— 
A 


1.7.3 SC-Form 


0 6 11 16 20 27 30 31 


OPCD //I //1 // LEV } // | 1] / 


Figure 5. SC instruction format 


1.7.4 D-Form 





Figure 6. D instruction format 


1.7.5 DS-Form 


Figure 7. DS instruction format 
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1.7.6 X-Form 


wo) 
o 
[ ~ ] 
n 
we) 


ro 
n 
x= 
| ~ | 
x< | X< 
O;O 
PRR Rann i Si ae ne Se nS i eS 
i) i) i) i) i) i=) i=) 


Kx] KI] «| KY] OK] KY] OK] OK] | KI x 
O;/OLO}/O;O};]O};}O;}O0;O0];]0;0 


a 
Di} DD) D| Di DD) Di] D| Div 
NI NDNI NDI NINDI NIN! NIM 


n 
x< 
Oo 


% 
EE 
a aueEE 
=e) ae) 
n 
|r | 


x 
oO 


x|/w}]o|]o 
Mane 
= 
x<| XK | X< 
O|O;0 


AB 
aol, 


~ 
~ 


ic = 


y 
<x] KX | XK] XK] xX] | KI KI] KI xX 
O;O}]O};O;O0;/0}0]; 0] OJ O 


0 

icc PG 
| opop | 
aerens) 
| opcp | 
ecco 
| OPCD | 
eonen 
| OPCD 
[ren] 
| OPCD 
[sozeny) 
| OPCD 
[zene 
| OPCD 
pene 
pore 
[eoneD ae! 
| OPCD | 
[gone ee 
| orcD | F 
[sOReee | 
| pod | F 
Beccos Me 
| oped | 
[ORO sid 
| oreD | 





x 
Oo 


Figure 8. X instruction format 


1.7.7 XL-Form 





Figure 9. XL instruction format 


1.7.8 XFX-Form 


orc [AT 


[90 
Force [ar ff pm pf x0 |r 
Force [As pew [x0 [7 





Figure 10. XFX instruction format 
1.7.9 XFL-Form 


0 67 15 16 31 
[oro] Aim Fe] xO Jrd 


Figure 11. XFL instruction format 
1.7.10 XS-Form 


0 6 11 16 21 30 31 
force [as [ ma] | x0 pe 


Figure 12. XS instruction format 


1.7.11 XO-Form 





Figure 13. XO instruction format 
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1.7.12 A-Form 





Figure 14. A instruction format 


1.7.13 M-Form 


[orc [as [ma | ne | we | ve 
[orco [as [ra | 


Figure 15. M instruction format 
1.7.14 MD-Form 


0 6 11 16 21 2730 31 
OPCD RS XOjshRd 


oreo [ws [ wa [sh [me fob 


Figure 16. MD instruction format 


1.7.15 MDS-Form 


27 
[or] AS] RA] RE] mb [xO]Re 
Torco [as [pa | AB | me [x0] 


Figure 17. MDS instruction format 


1.7.16 Instruction Fields 


AA (30) 


Absolute Address bit. 


0 The immediate field represents an address 
relative to the current instruction address. 
For I-form branches the effective address of 
the branch target is the sum of the LI field 
sign-extended to 64 bits and the address of 
the branch instruction. For B-form branches 
the effective address of the branch target is 
the sum of the BD field sign-extended to 64 
bits and the address of the branch instruc- 
tion. 


1 The immediate field represents an absolute 
address. For |-form branches the effective 
address of the branch target is the LI field 
sign-extended to 64 bits. For B-form 
branches the effective address of the branch 
target is the BD field sign-extended to 64 
bits. 


BA (11:15) 
Field used to specify a bit in the CR to be used as 
a source. 


BB (16:20) 
Field used to specify a bit in the CR to be used as 
a source. 


BD (16:29) 
Immediate field used to specify a 14-bit signed 
two's complement branch displacement which is 
concatenated on the right with Ob00 and sign- 
extended to 64 bits. 


BF (6:8) 
Field used to specify one of the CR fields or one 
of the FPSCR fields to be used as a target. 


BFA (11:13) 
Field used to specify one of the CR fields or one 
of the FPSCR fields to be used as a source. 


BH (19:20) 
Field used to specify a hint in the Branch Condi- 
tional to Link Register and Branch Conditional to 
Count Register instructions. The encoding is 
described in Section 2.4.1, “Branch Instructions” 
on page 20. 


BI (11:15) 
Field used to specify a bit in the CR to be tested 
by a Branch Conditional instruction. 


BO (6:10) 
Field used to specify options for the Branch Con- 
ditional instructions. The encoding is described in 
Section 2.4.1, “Branch Instructions” on page 20. 


BT (6:10) 
Field used to specify a bit in the CR or in the 
FPSCR to be used as a target. 
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D (16:31) 
Immediate field used to specify a 16-bit signed 
two's complement integer which is sign-extended 
to 64 bits. 


DS (16:29) 
Immediate field used to specify a 14-bit signed 
two's complement integer which is concatenated 
on the right with 0b00 and sign-extended to 64 
bits. 


FLM (7:14) 
Field mask used to identify the FPSCR fields that 
are to be updated by the mifsf instruction. 


FRA (11:15) 
Field used to specify an FPR to be used as a 
source. 


FRB (16:20) 
Field used to specify an FPR to be used as a 
source. 


FRC (21:25) 
Field used to specify an FPR to be used as a 
source. 


FRS (6:10) 
Field used to specify an FPR to be used as a 
source. 


FRT (6:10) 
Field used to specify an FPR to be used as a 
target. 


FXM (12:19) 
Field mask used to identify the CR fields that are 
to be written by the mterf and mtocerf instructions, 
or read by the mfoerf instruction. 


L (10 or 15) 
Field used to specify whether a_ fixed-point 
Compare instruction is to compare 64-bit 
numbers or 32-bit numbers. 


Field used by the Move To Machine State Reg- 
ister and TLB Invalidate Entry instructions (see 
Book Ill, PowerPC Operating Environment Archi- 
tecture). 


L (9:10) 
Field used by the Synchronize instruction (see 
Book Il, PowerPC Virtual Environment Architec- 
ture). 

LEV (20:26) 
Field used by the System Call instruction. 

LI (6:29) 
Immediate field used to specify a 24-bit signed 
two's complement integer which is concatenated 


on the right with 0b00 and sign-extended to 64 
bits. 


LK (31) 
LINK bit. 


0 Do not set the Link Register. 


1 Set the Link Register. The address of the 
instruction following the Branch instruction is 
placed into the Link Register. 


MB (21:25) and ME (26:30) 
Fields used in M-form instructions to specify a 
64-bit mask consisting of 1-bits from bit MB+32 
through bit ME+32 inclusive and O-bits else- 
where, as described in Section 3.3.12, “Fixed- 
Point Rotate and Shift Instructions” on page 68. 

MB (21:26) 
Field used in MD-form and MDS-form instructions 
to specify the first 1-bit of a 64-bit mask, as 
described in Section 3.3.12, “Fixed-Point Rotate 
and Shift Instructions” on page 68. 

ME (21:26) 
Field used in MD-form and MDS-form instructions 
to specify the last 1-bit of a 64-bit mask, as 
described in Section 3.3.12, “Fixed-Point Rotate 
and Shift Instructions” on page 68. 

NB (16:20) 
Field used to specify the number of bytes to 
move in an immediate Move Assist instruction. 


OPCD (0:5) 
Primary opcode field. 

OE (21) 
Field used by XO-form instructions to enable 
setting OV and SO in the XER. 

RA (11:15) 
Field used to specify a GPR to be used as a 
source or as a target. 

RB (16:20) 
Field used to specify a GPR to be used as a 
source. 

Re (31) 
RECORD bit. 


0 Do not alter the Condition Register. 
1 Set Condition Register Field 0 or Field 1 as 


described in Section 2.3.1, “Condition 
Register” on page 18. 
RS (6:10) 
Field used to specify a GPR to be used as a 
source. 
RT (6:10) 


Field used to specify a GPR to be used as a 
target. 

SH (16:20, or 16:20 and 30) 
Field used to specify a shift amount. 

SI (16:31) 
Immediate field used to specify a 16-bit signed 
integer. 

SPR (11:20) 
Field used to specify a Special Purpose Register 
for the mtspr and mfspr instructions. 
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SR (12:15) 
Field used by the Segment Register Manipulation 
instructions (see Book Ill, PowerPC Operating 
Environment Architecture). 

TBR (11:20) 


Field used by the Move From Time Base instruc- 
tion (see Book II, PowerPC Virtual Environment 
Architecture). 


TH (9:10) 
Field used by the optional data stream variant of 
the debt instruction (see Book II, PowerPC Virtual 
Environment Architecture). 


TO (6:10) 
Field used to specify the conditions on which to 
trap. The encoding is described in Section 3.3.10, 
“Fixed-Point Trap Instructions” on page 60. 


U (16:19) 
Immediate field used as the data to be placed 
into a field in the FPSCR. 


UI (16:31) 
Immediate field used to specify a 16-bit unsigned 
integer. 


XO (21:29, 21:30, 22:30, 26:30, 27:29, 27:30, or 30:31) 
Extended opcode field. 


1.8 Classes of Instructions 


An instruction falls into exactly one of the following 
three classes: 


Defined 
Illegal 
Reserved 


The class is determined by examining the opcode, and 
the extended opcode if any. If the opcode, or combi- 
nation of opcode and extended opcode, is not that of 
a defined instruction or of a reserved instruction, the 
instruction is illegal. 


A given instruction is in the same class for all imple- 
mentations of the PowerPC Architecture. In future 
versions of this architecture, instructions that are now 
illegal may become defined (by being added to the 
architecture) or reserved (by being assigned to one of 
the special purposes described in Appendix H, 
“Reserved Instructions” on page 177). Similarly, 


instructions that are now reserved may become 
defined. 


1.8.1 Defined Instruction Class 


This class of instructions contains all the instructions 
defined in the PowerPC User Instruction Set Architec- 
ture, PowerPC Virtual Environment Architecture, and 
PowerPC Operating Environment Architecture. 


In general, defined instructions are guaranteed to be 
provided in all implementations. The only exceptions 
are instructions that are optional instructions. These 
exceptions are _ identified in the instruction 
descriptions. 


A defined instruction can have preferred and/or 
invalid forms, as described in Section 1.9.1, “Pre- 
ferred Instruction Forms” on page 13 and Section 
1.9.2, “Invalid Instruction Forms” on page 13. 


1.8.2 Illegal Instruction Class 


This class of instructions contains the set of 
instructions described in Appendix G, “Illegal 
Instructions” on page 175. Illegal instructions are 
available for future extensions of the PowerPC Archi- 
tecture; that is, some future version of the PowerPC 
Architecture may define any of these instructions to 
perform new functions. 


Any attempt to execute an illegal instruction will 
cause the system illegal instruction error handler to 
be invoked and will have no other effect. 


An instruction consisting entirely of binary Os is guar- 
anteed always to be an illegal instruction. This 
increases the probability that an attempt to execute 
data or uninitialized storage will result in the invoca- 
tion of the system illegal instruction error handler. 


1.8.3 Reserved Instruction Class 


This class of instructions contains the set of 
instructions described in Appendix H, “Reserved 
Instructions” on page 177. 


Reserved instructions are allocated to specific pur- 
poses that are outside the scope of the PowerPC 
Architecture. 
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Any attempt to execute a reserved instruction will: 


m perform the actions described in Book IV, 
PowerPC Implementation Features for the imple- 
mentation if the instruction is implemented; or 

m cause the system illegal instruction error handler 
to be invoked if the instruction is not imple- 
mented. 


1.9 Forms of Defined Instructions 


1.9.1 Preferred Instruction Forms 


Some of the defined instructions have preferred 
forms. For such an instruction, the preferred form will 
execute in an efficient manner, but any other form 
may take significantly longer to execute than the pre- 
ferred form. 


Instructions having preferred forms are: 


the Condition Register Logical instructions 

the Load Quadword instruction 

the Load/Store Multiple instructions 

the Load/Store String instructions 

the Or Immediate instruction (preferred form of 
no-op) 

mu the Move To Condition Register Fields instruction 


1.9.2 Invalid Instruction Forms 


Some of the defined instructions can be coded in a 
form that is invalid. An instruction form is invalid if 
one or more fields of the instruction, excluding the 
opcode fields and reserved fields, are coded incor- 
rectly in a manner that can be deduced by examining 
only the instruction encoding. 


Any attempt to execute an invalid form of an instruc- 
tion will either cause the system illegal instruction 
error handler to be invoked or yield boundedly unde- 
fined results. Exceptions to this rule are stated in the 
instruction descriptions. 


If a field (excluding opcode fields) of one of the 
instructions identified below contains a value other 
than that specified in the layout diagram, the instruc- 
tion form is invalid. 


mu the Store Conditional instructions (see Book Il, 
PowerPC Virtual Environment Architecture) 


These invalid forms are not discussed further. The 
invalid forms of the instructions in the following list 
are identified in the instruction descriptions. 


= the Branch Conditional to Count Register 
(bectr[ !]) instruction 

a the Load/Store with Update instructions 

m the Load Multiple instruction 

a the Load String instructions 


m the’ Load/Store 
instructions 


Floating-Point with 


Update 


Assembler Note 





Assemblers should report uses of invalid instruc- 
tion forms as errors. 





1.10 Optionality 


Some of the defined instructions are optional. The 
optional instructions are defined in Chapter 5, 
“Optional Facilities and Instructions” on page 117. 
Additional optional instructions may be defined in 
Books II and Ill (e.g., see the section entitled “Look- 
aside Buffer Management” in Book III, and the chap- 
ters entitled “Optional Facilities and Instructions” in 
Book II and Book Ill). 


Any attempt to execute an optional instruction that is 
not provided by the implementation will cause the 
system illegal instruction error handler to be invoked. 


In addition to instructions, other kinds of optional 
facilities, such as registers, may be defined in Books 
ll and Ill. The effects of attempting to use an optional 
facility that is not provided by the implementation are 
described in Books II and III as appropriate. 


1.11 Exceptions 


There are two kinds of exception, those caused 
directly by the execution of an instruction and those 
caused by an asynchronous event. In either case, the 
exception may cause one of several components of 
the system software to be invoked. 


The exceptions that can be caused directly by the 
execution of an instruction include the following: 


= an attempt to execute an illegal instruction, or an 
attempt by an application program to execute a 
“privileged” instruction (see Book III, PowerPC 
Operating Environment Architecture) (system 
illegal instruction error handler or system privi- 
leged instruction error handler) 


m the execution of a defined instruction using an 
invalid form (system illegal instruction error 
handler or system privileged instruction error 
handler) 


m the execution of an optional instruction that is not 
provided by the implementation (system illegal 
instruction error handler) 


m= an attempt to access a storage location that is 
unavailable (system instruction storage error 
handler or system data storage error handler) 
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= an attempt to access storage with an effective 
address alignment that is invalid for the instruc- 
tion (system alignment error handler) 


m the execution of a System Call instruction 
(system service program) 


m the execution of a Trap instruction that traps 
(system trap handler) 


m the execution of a floating-point instruction that 
causes a floating-point enabled exception to exist 
(system floating-point enabled exception error 
handler) 


The exceptions that can be caused by an asynchro- 
nous event are described in Book Ill, PowerPC Oper- 
ating Environment Architecture. 


The invocation of the system error handler is precise, 
except that if one of the imprecise modes for invoking 
the system floating-point enabled exception error 
handler is in effect (see page 90) then the invocation 
of the system floating-point enabled exception error 
handler may be imprecise. When the system error 
handler is invoked imprecisely, the excepting instruc- 
tion does not appear to complete before the next 
instruction starts (because one of the effects of the 
excepting instruction, namely the invocation of the 
system error handler, has not yet occurred). 


Additional information about exception handling can 
be found in Book III, PowerPC Operating Environment 
Architecture. 


1.12 Storage Addressing 


A program references storage using the effective 
address computed by the processor when it executes 
a Storage Access or Branch instruction (or certain 
other instructions described in Book Il, PowerPC 
Virtual Environment Architecture, and Book _ Ill, 
PowerPC Operating Environment Architecture), or 
when it fetches the next sequential instruction. 


1.12.1 Storage Operands 


Bytes in storage are numbered consecutively starting 
with 0. Each number is the address of the corre- 
sponding byte. 


Storage operands may be bytes, halfwords, words, or 
doublewords, or, for the Load/Store Multiple and 
Move Assist instructions, a sequence of bytes or 
words. The address of a storage operand is the 
address of its first byte (i.e., of its lowest-numbered 
byte). Byte ordering is Big-Endian. However, if the 
optional Little-Endian facility is implemented the 
system can be operated in a mode in which byte 
ordering is Little-Endian; see Section 5.3. 





Operand length is implicit for each instruction. 


The operand of a single-register Storage Access 
instruction has a “natural” alignment boundary equal 
to the operand length. In other words, the “natural” 
address of an operand is an integral multiple of the 
operand length. A storage operand is said to be 
aligned if it is aligned at its natural boundary; other- 
wise it is said to be unaligned. 


Storage operands for single-register Storage Access 
instructions have the following characteristics. 
(Although not permitted as storage operands, 
quadwords are shown because quadword alignment is 
desirable for certain storage operands.) 

















Operand Length Addrgo.¢3 if aligned 
Byte 8 bits XXXX 

Halfword 2 bytes Xxx 

Word 4 bytes xx00 

Doubleword 8 bytes x000 

Quadword 16 bytes 0000 

Note: An “x” in an address bit position indicates 
that the bit can be 0 or 1 independent of the state of 
other bits in the address. 


The concept of alignment is also applied more gener- 
ally, to any datum in storage. For example, a 12-byte 
datum in storage is said to be word-aligned if its 
address is an integral multiple of 4. 


Some instructions require their storage operands to 
have certain alignments. In addition, alignment may 
affect performance. For single-register Storage 
Access instructions the best performance is obtained 
when storage operands are aligned. Additional 
effects of data placement on performance are 
described in Book Il, PowerPC Virtual Environment 
Architecture. 


Instructions are always four bytes long and word- 
aligned. 


1.12.2 Effective Address Calculation 


An effective address is computed by the processor 
when executing a Storage Access or Branch instruc- 
tion (or certain other instructions described in Book Il, 
PowerPC Virtual Environment Architecture, and Book 
Ill, PowerPC Operating Environment Architecture) or 
when fetching the next sequential instruction. The fol- 
lowing provides an overview of this process. More 
detail is provided in the individual instruction 
descriptions. 


Effective address calculations, for both data and 
instruction accesses, use 64-bit two's complement 
addition. All 64 bits of each address component par- 
ticipate in the calculation regardless of mode (32-bit 
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or 64-bit). In this computation one operand is an 
address (which is by definition an unsigned number) 
and the second is a signed offset. Carries out of the 
most significant bit are ignored. 


In 64-bit mode, the entire 64-bit result comprises the 
64-bit effective address. The effective address arith- 
metic wraps around from the maximum address, 
2% _ 1, to address 0. 


In 32-bit mode, the low-order 32 bits of the 64-bit 
result comprise the effective address for the purpose 
of addressing storage. The high-order 32 bits of the 
64-bit effective address are ignored for the purpose of 
accessing data, but are included whenever an effec- 
tive address is placed into a GPR by Load with Update 
and Store with Update instructions. The high-order 32 
bits of the 64-bit effective address are effectively set 
to 0 for the purpose of fetching instructions, and 
explicitly so whenever an effective address is placed 
into the Link Register by Branch instructions having 
LK=1. The high-order 32 bits of the 64-bit effective 
address are set to 0 in Special Purpose Registers 
when the system error handler is invoked. As used to 
address storage, the effective address arithmetic 
appears to wrap around from the maximum address, 
2° 1, to address 0 in 32-bit mode. 


The 64-bit current instruction address and next 
instruction address are not affected by a change from 
32-bit mode to 64-bit mode, but they are affected by a 
change from 64-bit mode to 32-bit mode. In the latter 
case, the high-order 32 bits are set to 0. 


RA is a field in the instruction which specifies an 
address component in the computation of an effective 
address. A zero in the RA field indicates the absence 
of the corresponding address component. A value of 
zero is substituted for the absent component of the 
effective address address computation. This substi- 
tution is shown in the instruction descriptions as 
(RA|O). 


Effective addresses are computed as follows. In the 
descriptions below, it should be understood that “the 
contents of a GPR” refers to the entire 64-bit con- 
tents, independent of mode, but that in 32-bit mode 
only bits 32:63 of the 64-bit result of the computation 
are used to address storage. 


a With X-form instructions, in computing the effec- 
tive address of a data element, the contents of 


the GPR designated by RB (or the value zero for 
Iswi, Isdi, stswi, and stsdi) are added to the con- 
tents of the GPR designated by RA or to zero if 
RA=0. 


a With D-form instructions, the 16-bit D field is sign- 
extended to form a 64-bit address component. In 
computing the effective address of a data 
element, this address component is added to the 
contents of the GPR designated by RA or to zero 
if RA=0. 


a With DS-form instructions, the 14-bit DS field is 
concatenated on the right with O0b00 and sign- 
extended to form a 64-bit address component. In 
computing the effective address of a data 
element, this address component is added to the 
contents of the GPR designated by RA or to zero 
if RA=0. 


a With l-form Branch instructions, the 24-bit LI field 
is concatenated on the right with 0b00 and sign- 
extended to form a 64-bit address component. If 
AA=0, this address component is added to the 
address of the Branch instruction to form the 
effective address of the next instruction. _ If 
AA=1, this address component is the effective 
address of the next instruction. 


a With B-form Branch instructions, the 14-bit BD 
field is concatenated on the right with 0b00 and 
sign-extended to form a 64-bit address compo- 
nent. If AA=0, this address component is added 
to the address of the Branch instruction to form 
the effective address of the next instruction. If 
AA=1, this address component is the effective 
address of the next instruction. 


a With XL-form Branch instructions, bits 0:61 of the 
Link Register or the Count Register are concat- 
enated on the right with 0b00 to form the effec- 
tive address of the next instruction. 


a With sequential instruction fetching, the value 4 is 
added to the address of the current instruction to 
form the effective address of the next instruction. 


= For an exception to sequential addressing when a 
change from 32- to 64-bit mode occurs, see the 
section entitled “Address Wrapping Combined 
with Changing MSR Bit SF” in Book III. 


If the size of the operand of a storage access instruc- 
tion is more than one byte, the effective address for 
each byte after the first is computed by adding 1 to 
the effective address of the preceding byte. 
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Chapter 2. Branch Processor 


2.1 Branch Processor Overview .... 17 
2.2 Instruction Execution Order .... 17 
2.3 Branch Processor Registers .... 18 
2.3.1 Condition Register ......... 18 
2.3.2 Link Register ............ 19 
2.3.3 Count Register ........... 19 
2.4 Branch Processor Instructions ... 20 


2.1 Branch Processor Overview 


This chapter describes the registers and instructions 
that make up the Branch Processor facility. Section 
2.3, “Branch Processor Registers” on page 18 
describes the registers associated with the Branch 
Processor. Section 2.4, “Branch Processor 
Instructions” on page 20 describes the instructions 
associated with the Branch Processor. 


2.2 Instruction Execution Order 


In general, instructions appear to execute sequen- 
tially, in the order in which they appear in storage. 
The exceptions to this rule are listed below. 


= Branch instructions for which the branch is taken 
cause execution to continue at the target address 
specified by the Branch instruction. 


m Trap instructions for which the trap conditions are 
satisfied, and System Call instructions, cause the 
appropriate system handler to be invoked. 


m= Exceptions can cause the system error handler to 
be invoked, as described in Section 1.11, 
“Exceptions” on page 13. 


= Returning from a system service program, 
system trap handler, or system error handler 
causes execution to continue at a_ specified 
address. 


2.4.1 Branch Instructions ........ 20 
2.4.2 System Call Instruction ...... 25 
2.4.3 Condition Register Logical 
Instructions ................ 26 
2.4.4 Condition Register Field 
INStUCHION? ac 25.3 Se eae ee SG 28 


The model of program execution in which the 
processor appears to execute one instruction at a 
time, completing each instruction before beginning to 
execute the next instruction is called the “sequential 
execution model”. In general, the processor obeys 
the sequential execution model. For the instructions 
and facilities defined in this Book, the only exceptions 
to this rule are the following. 


= A_ floating-point exception occurs when the 
processor is running in one of the Imprecise float- 
ing-point exception modes (see Section 4.4, 
“Floating-Point Exceptions” on page 89). The 
instruction that causes the exception does not 
complete before the next instruction begins exe- 
cution, with respect to setting exception bits and 
(if the exception is enabled) invoking the system 
error handler. 


= A Store instruction modifies one or more bytes in 
an area of storage that contains instructions that 
will subsequently be executed. Before an instruc- 
tion in that area of storage is executed, software 
synchronization is required to ensure that the 
instructions executed are consistent with the 
results produced by the Store instruction. 


Programming Note 


This software synchronization will generally 
be provided by system library programs (see 
the section entitled “Instruction Storage” in 
Book Il). Application programs should call the 
appropriate system library program before 
attempting to execute modified instructions. 
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2.3 Branch Processor Registers 


2.3.1 Condition Register 


The Condition Register (CR) is a 32-bit register which 
reflects the result of certain operations, and provides 
a mechanism for testing (and branching). 


0 31 


Figure 18. Condition Register 


The bits in the Condition Register are grouped into 
eight 4-bit fields, named CR Field 0 (CRO), ..., CR Field 
7 (CR7), which are set in one of the following ways. 


= Specified fields of the CR can be set by a move 
to the CR from a GPR (mterf, mtocrf). 

= A specified field of the CR can be set by a move 
to the CR from another CR field (merf), from 
XER39.35 (merxr), or from the FPSCR (merfs). 

= CR Field 0 can be set as the implicit result of a 
fixed-point instruction. 

m= CR Field 1 can be set as the implicit result of a 
floating-point instruction. 

= A specified CR field can be set as the result of 
either a fixed-point or a floating-point Compare 
instruction. 


Instructions are provided to perform logical oper- 
ations on individual CR bits and to test individual CR 
bits. 


For all fixed-point instructions in which Rc=1, and for 
addic., andi., and andis., the first three bits of CR 
Field 0 (bits 0:2 of the Condition Register) are set by 
signed comparison of the result to zero, and the 
fourth bit of CR Field 0 (bit 3 of the Condition Reg- 
ister) is copied from the SO field of the XER. “Result” 
here refers to the entire 64-bit value placed into the 
target register in 64-bit mode, and to bits 32:63 of the 
64-bit value placed into the target register in 32-bit 
mode. 


if (64-bit mode) 

then M ¢ 0 

else M € 32 
if (target_register) y.e3 < 0 then c € 0b100 
else if (target_register) weg > 0 then c € 0b010 
else c € 0b001 
CRO €¢ Il XERgo 


If any portion of the result is undefined, then the 
value placed into the first three bits of CR Field 0 is 
undefined. 

The bits of CR Field 0 are interpreted as follows. 

Bit Description 


0 Negative (LT) 
The result is negative. 


1 Positive (GT) 
The result is positive. 


2 = Zero (EQ) 
The result is zero. 


3 Summary Overflow (SO) 
This is a copy of the final state of XERgo at the 
completion of the instruction. 





Programming Note 


CR Field 0 may not reflect the “true” (infinitely 
precise) result if overflow occurs; see Section 
3.3.8, “Fixed-Point Arithmetic Instructions” on 
page 49. 





The stwex. and stdcex. instructions (see Book Il, 
PowerPC Virtual Environment Architecture) also set 
CR Field 0. 


For all floating-point instructions in which Rc=1, CR 
Field 1 (bits 4:7 of the Condition Register) is set to the 
Floating-Point exception status, copied from bits 0:3 of 
the Floating-Point Status and Control Register. These 
bits are interpreted as follows. 


Bit Description 


4 Floating-Point Exception Summary (FX) 
This is a copy of the final state of FPSCRry at the 
completion of the instruction. 


5 Floating-Point Enabled Exception Summary (FEX) 
This is a copy of the final state of FPSCRer¢y at 
the completion of the instruction. 


6 Floating-Point 
Summary (VX) 
This is a copy of the final state of FPSCRy, at the 
completion of the instruction. 


Invalid Operation Exception 


7 Floating-Point Overflow Exception (OX) 
This is a copy of the final state of FPSCRo, at 
the completion of the instruction. 


For Compare instructions, a specified CR field is set 
to reflect the result of the comparison. The bits of the 
specified CR field are interpreted as follows. A com- 
plete description of how the bits are set is given in 
the instruction descriptions in Section 3.3.9, “Fixed- 
Point Compare Instructions” on page 58 and Section 
4.6.7, “Floating-Point Compare Instructions” on 
page 113. 


Bit Description 


0 Less Than, Floating-Point Less Than (LT, FL) 
For fixed-point Compare instructions, (RA) < Sl 
or (RB) (signed comparison) or (RA) < UI or (RB) 
(unsigned comparison). For _ floating-point 
Compare instructions, (FRA) < (FRB). 
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1 Greater Than, Floating-Point Greater Than (GT, 
FG) 
For fixed-point Compare instructions, (RA) > Sl 
or (RB) (signed comparison) or (RA) > UI or (RB) 
(unsigned comparison). For _ floating-point 
Compare instructions, (FRA) > (FRB). 


2 Equal, Floating-Point Equal (EQ, FE) 
For fixed-point Compare instructions, (RA) = Sl, 
Ul, or (RB). For floating-point Compare 
instructions, (FRA) = (FRB). 


3 Summary Overflow, Floating-Point Unordered 
(SO, FU) 
For fixed-point Compare instructions, this is a 
copy of the final state of XERgo at the completion 
of the instruction. For floating-point Compare 
instructions, one or both of (FRA) and (FRB) is a 
NaN. 


2.3.2 Link Register 


The Link Register (LR) is a 64-bit register. It can be 
used to provide the branch target address for the 
Branch Conditional to Link Register instruction, and it 
holds the return address after Branch instructions for 
which LK=1. 


| LR 
0 63 


Figure 19. Link Register 


2.3.3 Count Register 


The Count Register (CTR) is a 64-bit register. It can 
be used to hold a loop count that can be decremented 
during execution of Branch instructions that contain 
an appropriately coded BO field. If the value in the 
Count Register is 0 before being decremented, it is 
-—1 afterward. The Count Register can also be used 
to provide the branch target address for the Branch 
Conditional to Count Register instruction. 


CTR 
0 63 


Figure 20. Count Register 
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2.4 Branch Processor Instructions 


2.4.1 Branch Instructions 


The sequence of instruction execution can be changed 
by the Branch instructions. Because all instructions 
are on word boundaries, bits 62 and 63 of the gener- 
ated branch target address are ignored by the 
processor in performing the branch. 


The Branch instructions compute the effective 
address (EA) of the target in one of the following four 
ways, as described in Section 1.12.2, “Effective 
Address Calculation” on page 14. 


1. Adding a displacement to the address of the 
Branch instruction (Branch or Branch Conditional 
with AA=0). 


2. Specifying an absolute address (Branch or 
Branch Conditional with AA=1). 


3. Using the address contained in the Link Register 
(Branch Conditional to Link Register). 


4. Using the address contained in the Count Reg- 
ister (Branch Conditional to Count Register). 


In all four cases, in 32-bit mode the final step in the 
address computation is setting the high-order 32 bits 
of the target address to 0. 


For the first two methods, the target addresses can 
be computed sufficiently ahead of the Branch instruc- 
tion that instructions can be prefetched along the 
target path. For the third and fourth methods, pre- 
fetching instructions along the target path is also pos- 
sible provided the Link Register or the Count Register 
is loaded sufficiently ahead of the Branch instruction. 


Branching can be conditional or unconditional, and 
the return address can optionally be provided. If the 
return address is to be provided (LK=1), the effective 
address of the instruction following the Branch 
instruction is placed into the Link Register after the 
branch target address has been computed; this is 
done regardless of whether the branch is taken. 


For Branch Conditional instructions, the BO field 
specifies the conditions under which the branch is 
taken, as shown in Figure 21. In the figure, M=0 in 
64-bit mode and M=32 in 32-bit mode. If the BO field 
specifies that the CTR is to be decremented, the 
entire 64-bit CTR is decremented regardless of the 
mode. 


Decrement the CTR, then branch if the 
decremented CTRiy.g3# 0 and CRg)= 0 
Decrement the CTR, then branch if the 


decremented CTRy.g3= 0 and CR»g)= 0 
Branch if CRg)=0 


Decrement the CTR, then branch if the 
decremented CTRy.g3# 0 and CRa)= 1 


Decrement the CTR, then branch if the 
decremented CTRy-g3= 0 and CRa)= 1 


Branch if CRp)= 1 


Decrement the CTR, then branch if the 
decremented CTRiy.¢3# 0 


Decrement the CTR, a branch if the 
decremented CTRiy.¢3= 0 


Branch always 


ee 
.“z” denotes a bit that is ignored. 
: The “a” and “t” bits are used as described 
below. 





Figure 21. BO field encodings 


The “a” and “t” bits of the BO field can be used by 
software to olide a hint about whether the branch 
is likely to be taken or is likely not to be taken, as 
shown in Figure 22. 


at | Hint 


[oo [No vinisahen 


Figure 22. 





"at" bit encodings 





Programming Note 





Many implementations have dynamic mechanisms 
for predicting whether a branch will be taken. 
Because the dynamic prediction is likely to be 
very accurate, and is likely to be overridden by 
any hint provided by the “at” bits, the “at” bits 
should be set to 0b00 unless the static prediction 
implied by at=0b10 or at=0b11 is highly likely to 
be correct. 
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For Branch Conditional to Link Register and Branch 
Conditional to Count Register instructions, the BH 
field provides a hint about the use of the instruction, 
as shown in Figure 23. 


The instruction is a subroutine return 

becir[ I: The instruction is not a subroutine 
return; the target address is likely to 
be the same as the target address 
used the preceding time the branch 
was taken 


The instruction is not a subroutine 
return; the target address is likely to 
be the same as the target address 
used the preceding time the branch 
was taken 

becir[ |: Reserved 


11 | belr{l and bectr[!: The target address is not 
predictable 


Figure 23. BH field encodings 








Programming Note 


The hint provided by the BH field is independent 
of the hint provided by the “at” bits (e.g., the BH 
field provides no indication of whether the branch 
is likely to be taken). 





Extended mnemonics for branches 


Many extended mnemonics are provided so that 
Branch Conditional instructions can be coded with 
portions of the BO and BI fields as part of the mne- 
monic rather than as part of a numeric operand. 
Some of these are shown as examples with the 
Branch instructions. See Appendix B, “Assembler 
Extended Mnemonics” on page 143 for additional 
extended mnemonics. 








Programming Note 


The hints provided by the “at” bits and by the BH 
field do not affect the results of executing the 
instruction. 


The “z” bits should be set to 0, because they may 
be assigned a meaning in some future version of 
the architecture. 








Programming Note 


Many implementations have dynamic mechanisms 
for predicting the target addresses of belr[f] and 
becir[!| instructions. These mechanisms may 
cache return addresses (i.e., Link Register values 
set by Branch instructions for which LK=1 and for 
which the branch was taken) and recently used 
branch target addresses. To obtain the best per- 
formance across the widest range of implementa- 
tions, the programmer should obey the following 
rules. 


m= Use Branch instructions for which LK=1 only 
as subroutine calls (including function calls, 
etc.). 

m Pair each subroutine call (i.e., each Branch 
instruction for which LK=1 and the branch is 
taken) with a belr instruction that returns from 
the subroutine and has BH=0b00. 

= Do not use belrl as a subroutine call. (Some 
implementations access the return address 
cache at most once per instruction; such 
implementations are likely to treat belrl as a 
subroutine return, and not as a subroutine 
call.) 

= For belr{f and bectr[!, use the appropriate 
value in the BH field. 


The following are examples of programming con- 
ventions that obey these rules. In the examples, 
BH is assumed to contain 0b00 unless otherwise 
stated. In addition, the “at” bits are assumed to 
be coded appropriately. 


Let A, B, and Glue be specific programs. 


= Loop counts: 
Keep them in the Count Register, and use a 
be instruction (LK=0) to decrement the count 
and to branch back to the beginning of the 
loop if the decremented count is nonzero. 


= Computed goto's, case statements, etc.: 
Use the Count Register to hold the address to 
branch to, and use a bectr instruction (LK=0, 
and BH=0b11 if appropriate) to branch to the 
selected address. 


m= Direct subroutine linkage: 


Here A calls B and B returns to A. The two 
branches should be as follows. 


— A calls B: use a bl or bel instruction 
(LK=1). 

— B returns to A: use a belr instruction 
(LK=0) (the return address is in, or can 
be restored to, the Link Register). 


m Indirect subroutine linkage: 

Here A calls Glue, Glue calls B, and B returns 
to A rather than to Glue. (Such a calling 
sequence is common in linkage code used 
when the subroutine that the programmer 
wants to call, here B, is in a different module 
from the caller; the Binder inserts “glue” 
code to mediate the branch.) The three 
branches should be as follows. 


— A calls Glue: use a bi or bel instruction 
(LK=1). 


(Programming Note continues in next column....) 
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Programming Note (continued) 


— Glue calls B: place the address of B into 
the Count Register, and use a_ bectr 
instruction (LK=0). 

— B returns to A: use a belr instruction 
(LK=0) (the return address is in, or can 
be restored to, the Link Register). 


m= Function call: 

Here A calls a function, the identity of which 
may vary from one instance of the call to 
another, instead of calling a specific program 
B. This case should be handled using the 
conventions of the preceding two bullets, 
depending on whether the call is direct or 
indirect, with the following differences. 


— If the call is direct, place the address of 
the function into the Count Register, and 
use a bectrl instruction (LK=1) instead of 
a bl or bel instruction. 

— For the beetr[/| instruction that branches 
to the function, use BH=0b11 if appro- 
priate. 








Compatibility Note 
The bits corresponding to the current “a” and “t” 


bits, and to the current “z” bits except in the 
“branch always” BO encoding, had different 
meanings in versions of the architecture that 


precede Version 2.00. 


m The bit corresponding to the “t” bit was called 
the “y” bit. The “y” bit indicated whether to 
use the architected default prediction (y=0) 
or to use the complement of the default pre- 
diction (y=1). The default prediction was 
defined as follows. 


— If the instruction is be[ [a] with a nega- 
tive value in the displacement field, the 
branch is taken. (This is the only case in 
which the prediction corresponding to the 
“y” bit differs from the prediction corre- 
sponding to the “t” bit.) 


— In all other cases (be[f|[a] with a non- 
negative value in the displacement field, 
belr{f], or bectr{l), the branch is not 
taken. 


= The BO encodings that test both the Count 
Register and the Condition Register had a “y” 
bit in place of the current “z” bit. The 
meaning of the “y” bit was as described in 
the preceding item. 


a The “a” bit was a “z” bit. 


Because these bits have always been defined 
either to be ignored or to be treated as hints, a 
given program will produce the same result on 
any implementation regardless of the values of 
the bits. Also, because even the “y” bit is 
ignored, in practice, by most processors that 
implement versions of the architecture that 
precede Version 2.00, the performance of a given 
program on those processors will not be affected 
by the values of the bits. 
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Branch I-form 


Branch Conditional B-form 


b target_addr (AA=0 LK=0) 
ba target_addr (AA=1 LK=0) 
bl target_addr (AA=0 LK=1) 
bla target_addr (AA=1 LK=1) 

18 LI AA|LK 
0 6 30 | 31 


if AA then NIA €., EXTS(LI Il 0b00) 
else NIA €,., CIA + EXTS(LI II 0b00) 
if LK then LR €,, CIA + 4 


target_adadr specifies the branch target address. 


lf AA=0 then the branch target address is the sum of 
LI || Ob00 sign-extended and the address of this 
instruction, with the high-order 32 bits of the branch 
target address set to 0 in 32-bit mode. 


If AA=1 then the branch target address is the value 
LI Il Ob00 sign-extended, with the high-order 32 bits of 
the branch target address set to 0 in 32-bit mode. 


If LK=1 then the effective address of the instruction 
following the Branch instruction is placed into the Link 
Register. 


Special Registers Altered: 
LR (if LK=1) 


bc BO,Bl,target_addr (AA=0 LK=0) 
bca BO,Bl,target_addr (AA=1 LK=0) 
bel BO,Bl,target_addr (AA=0 LK=1) 
bcla BO,Bl,target_addr (AA=1 LK=1) 


16 BO Bl BD AA\LK 
0 6 1 16 30 | 31 


if (64-bit mode) 

then M € 0 

else M € 32 
if -BO> then CTR ¢ CTR - 1 
ctr_ok € BOs | ((CTRy.63 # 0) ® BOs) 
cond_ok ¢ BOg | (CRg; = B04) 
if ctr_ok & cond_ok then 

if AA then NIA €,., EXTS(BD Il 0b00) 

else NIA €., CIA + EXTS(BD II 0b00) 
if LK then LR 4., CIA + 4 


The BI field specifies the Condition Register bit to be 
tested. The BO field is used to resolve the branch as 
described in Figure 21. target_addr specifies the 
branch target address. 


If AA=0 then the branch target address is the sum of 
BD || Ob00 sign-extended and the address of this 
instruction, with the high-order 32 bits of the branch 
target address set to 0 in 32-bit mode. 


If AA=1 then the branch target address is the value 
BD || 0b00 sign-extended, with the high-order 32 bits 
of the branch target address set to 0 in 32-bit mode. 


lf LK=1 then the effective address of the instruction 
following the Branch instruction is placed into the Link 
Register. 


Special Registers Altered: 
CTR (if BOo=0) 
LR (if LK=1) 


Extended Mnemonics: 


Examples of extended mnemonics for Branch Condi- 
tional: 


Extended: Equivalent to: 

blt target bc 12,0,target 
bne cr2,target bc 4,10,target 
bdnz target bc 16,0,target 
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Branch Conditional to Link Register 
XL-form 


belr BO,BI,BH (LK=0) 
belrl BO,BI,BH (LK=1) 


[POWER mnemonics: ber, berl] 


19 BO BI //! |BH 16 LK 
0 6 11 16 |19 | 21 31 


if (64-bit mode) 
then M ¢ 0 
else M € 32 
if -BO> then CTR ¢ CTR - 1 
ctr_ok € BOs | ((CTRy.63 # 0) ® BOs) 
cond_ok ¢ BOg | (CRp; = B04) 
if ctr_ok & cond_ok then NIA €,.2 LRo.g, Il 0b00 
if LK then LR 4., CIA + 4 


The BI field specifies the Condition Register bit to be 
tested. The BO field is used to resolve the branch as 
described in Figure 21. The BH field is used as 
described in Figure 23. The branch target address is 
LRo-g; |! Ob00, with the high-order 32 bits of the 
branch target address set to 0 in 32-bit mode. 


If LK=1 then the effective address of the instruction 
following the Branch instruction is placed into the Link 
Register. 


Special Registers Altered: 
CTR (if BOo=0) 
LR (if LK=1) 


Extended Mnemonics: 


Examples of extended mnemonics for Branch Condi- 
tional to Link Register: 


Extended: Equivalent to: 
belr 46 bclr  4,6,0 

bltlr belr 12,0,0 
bnelr cr2 belr 4,10,0 
bdnzlr belr 16,0,0 


Programming Note 


belr, belrl, bectr, and bectrl each serve as both a 
basic and an extended mnemonic. The Assembler 
will recognize a belr, belrl, bectr, or bectrl mne- 
monic with three operands as the basic form, and 
a belr, belrl, bectr, or bectrl mnemonic with two 
operands as the extended form. In the extended 
form the BH operand is omitted and assumed to 
be 0b00. 








Branch Conditional to Count Register 
XL-form 


bectr BO,BI,BH (LK=0) 
bectrl BO,BI,BH (LK=1) 


[POWER mnemonics: bcc, becl] 


19 BO BI //! |BH 528 LK 
0 6 11 16 |19 [21 31 


cond_ok € BOg | (CRp, = BO,) 
if cond_ok then NIA €., CTRg.g; II 0b00 
if LK then LR ¢., CIA + 4 


The BI field specifies the Condition Register bit to be 
tested. The BO field is used to resolve the branch as 
described in Figure 21. The BH field is used as 
described in Figure 23. The branch target address is 
CTRo-¢; !!| 0600, with the high-order 32 bits of the 
branch target address set to 0 in 32-bit mode. 


If LK=1 then the effective address of the instruction 
following the Branch instruction is placed into the Link 
Register. 


If the “decrement and test CTR” option is specified 
(BO,=0), the instruction form is invalid. 


Special Registers Altered: 
LR (if LK=1) 
Extended Mnemonics: 


Examples of extended mnemonics for Branch Condi- 
tional to Count Register. 


Extended: Equivalent to: 
bectr 4,6 bectr 4,6,0 

bltctr bectr 12,0,0 
bnectr cr2 bectr 4,10,0 
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2.4.2 System Call Instruction 


This instruction provides the means by which a 
program can call upon the system to perform a 
service. 


System Call SC-form 


sc LEV 
[POWER mnemonic: svca] 


a4 //I //1 // LEV |} // | 1] / 


This instruction calls the system to perform a service. 
A complete description of this instruction can be 
found in Book Ill, PowerPC Operating Environment 
Architecture. 


The use of the LEV field is described in Book Ill. The 
value contained in the LEV field is effectively limited 
to 0 or 1, bits 0:5 of the LEV field being treated as a 
reserved field in this form. 


When control is returned to the program that exe- 
cuted the System Call instruction, the contents of the 
registers will depend on the register conventions used 
by the program providing the system service. 


This instruction is context synchronizing (see Book Ill, 
PowerPC Operating Environment Architecture). 


Special Registers Altered: 
Dependent on the system service 





Programming Note 


sc serves as both a basic and an extended mne- 
monic. The Assembler will recognize an se mne- 
monic with one operand as the basic form, and an 
sc mnemonic with no operand as the extended 
form. In the extended form the LEV operand is 
omitted and assumed to be 0. 


In application programs the value of the LEV 
operand for se should be 0. 








Compatibility Note 


For a discussion of POWER compatibility with 
respect to instruction bits 16:29, see Appendix E, 
“Incompatibilities with the POWER Architecture” 
on page 163. 
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2.4.3 Condition Register Logical Instructions 


The Condition Register Logical instructions have pre- 
ferred forms; see Section 1.9.1, “Preferred Instruction 
Forms” on page 13. In the preferred forms, the BT 
and BB fields satisfy the following rule. 


m The bit specified by BT is in the same Condition 
Register field as the bit specified by BB. 


Condition Register AND XL-form 
crand BT,BA,BB 


19 BT BA BB 257 / 
0 6 4 16 21 31 


CRpt ¢ CRpq & CRep 


The bit in the Condition Register specified by BA is 
ANDed with the bit in the Condition Register specified 
by BB, and the result is placed into the bit in the Con- 
dition Register specified by BT. 


Special Registers Altered: 
R Bt 


Condition Register XOR XL-form 
crxor BT,BA,BB 


19 BT BA BB 193 / 
0 6 if 16 21 31 


CRpr ¢ CRgq © CRgp 


The bit in the Condition Register specified by BA is 
XORed with the bit in the Condition Register specified 
by BB, and the result is placed into the bit in the Con- 
dition Register specified by BT. 


Special Registers Altered: 
R Bt 


Extended Mnemonics: 


Example of extended mnemonics for Condition Reg- 


ister XOR: 
Extended: Equivalent to: 
crelr Bx crxor Bx,Bx,Bx 


Extended mnemonics for Condition 
Register logical operations 


A set of extended mnemonics is provided that allow 
additional Condition Register logical operations, 
beyond those provided by the basic Condition Reg- 
ister Logical instructions, to be coded easily. Some of 
these are shown as examples with the Condition Reg- 
ister Logical instructions. See Appendix B, “Assem- 
bler Extended Mnemonics” on page 143 for additional 
extended mnemonics. 


Condition Register OR XL-form 


cror BT,BA,BB 


19 BT BA BB 449 / 
0 6 14 16 21 31 


CRet € CReaq | CRep 


The bit in the Condition Register specified by BA is 
ORed with the bit in the Condition Register specified 
by BB, and the result is placed into the bit in the Con- 
dition Register specified by BT. 


Special Registers Altered: 
R Bt 


Extended Mnemonics: 


Example of extended mnemonics for Condition Reg- 
ister OR: 


Extended: 
crmove Bx,By cror 


Condition Register NAND XL-form 


Equivalent to: 
Bx,By,By 


crnand BT,BA,BB 
19 BT BA BB 225 / 
0 6 1 16 21 31 


CRet ¢€ =(CRea & CRpp) 


The bit in the Condition Register specified by BA is 
ANDed with the bit in the Condition Register specified 
by BB, and the complemented result is placed into the 
bit in the Condition Register specified by BT. 


Special Registers Altered: 
R Bt 
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Condition Register NOR XL-form 


crnor BT,BA,BB 


19 BT BA BB 33 / 
0 6 11 16 21 31 


CRer € 7(CRgq | CRpp) 


The bit in the Condition Register specified by BA is 
ORed with the bit in the Condition Register specified 
by BB, and the complemented result is placed into the 
bit in the Condition Register specified by BT. 


Special Registers Altered: 
CR pt 


Extended Mnemonics: 


Example of extended mnemonics for Condition Reg- 


ister NOR: 
Extended: Equivalent to: 
crnot Bx,By crnor Bx,By,By 


Condition Register AND with 
Complement XL-form 


crandc BT,BA,BB 


19 BT BA BB 129 / 
0 6 14 16 21 31 


CRet ¢ CRea & 7CRpp 


The bit in the Condition Register specified by BA is 
ANDed with the complement of the bit in the Condi- 
tion Register specified by BB, and the result is placed 
into the bit in the Condition Register specified by BT. 


Special Registers Altered: 
CR pt 


Condition Register Equivalent XL-form 


creqv BT,BA,BB 


19 BT BA BB 289 / 
0 6 11 16 21 31 


CRet ¢ CRea = CRep 


The bit in the Condition Register specified by BA is 
XORed with the bit in the Condition Register specified 
by BB, and the complemented result is placed into the 
bit in the Condition Register specified by BT. 


Special Registers Altered: 
CR pt 
Extended Mnemonics: 
Example of extended mnemonics for Condition Reg- 
ister Equivalent: 


Extended: 
crset Bx 


Equivalent to: 
creqv Bx,Bx,Bx 


Condition Register OR with Complement 
XL-form 


crorc BT,BA,BB 


19 BT BA BB 417 / 
0 6 14 16 21 31 


CRet ¢ CRea | 7CRpp 


The bit in the Condition Register specified by BA is 
ORed with the complement of the bit in the Condition 
Register specified by BB, and the result is placed into 
the bit in the Condition Register specified by BT. 


Special Registers Altered: 
CR pt 
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2.4.4 Condition Register Field 
Instruction 


Move Condition Register Field XL-form 


merf BF,BFA 





19 BF |//| BFA}//)  /// 0 / 
0 6 9} 44 14] 16 21 31 


CRaxBr4xBF+3 © CR4,BeAaxBFA+3 


The contents of Condition Register field BFA are 
copied to Condition Register field BF. 


Special Registers Altered: 
CR field BF 
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Chapter 3. Fixed-Point Processor 


3.1 Fixed-Point Processor Overview .. 29 
3.2 Fixed-Point Processor Registers .. 29 
3.2.1 General Purpose Registers .... 29 


3.2.2 Fixed-Point Exception Register . 30 
3.3 Fixed-Point Processor Instructions 31 
3.3.1 Fixed-Point Storage Access 


Instructions ................ 31 
3.3.1.1 Storage Access Exceptions ... 31 
3.3.2 Fixed-Point Load Instructions .. 31 
3.3.3 Fixed-Point Store Instructions .. 38 
3.3.4 Fixed-Point Load and Store with 

Byte Reversal Instructions ....... 42 
3.3.5 Fixed-Point Load and Store 

Multiple Instructions ........... 44 


3.1 Fixed-Point Processor Overview 


This chapter describes the registers and instructions 
that make up the Fixed-Point Processor facility. 
Section 3.2, “Fixed-Point Processor Registers” 
describes the registers associated with the Fixed- 
Point Processor. Section 3.3, “Fixed-Point Processor 
Instructions” on page 31 describes the instructions 
associated with the Fixed-Point Processor. 


3.2 Fixed-Point Processor 
Registers 


3.3.6 Fixed-Point Move Assist 
Instructions 
3.3.7 Other Fixed-Point Instructions .. 48 
3.3.8 Fixed-Point Arithmetic Instructions 49 
3.3.9 Fixed-Point Compare Instructions 58 
3.3.10 Fixed-Point Trap Instructions .. 60 
3.3.11 Fixed-Point Logical Instructions 62 
3.3.12 Fixed-Point Rotate and Shift 
Instructions ................ 68 
3.3.12.1 Fixed-Point Rotate Instructions 68 
3.3.12.2 Fixed-Point Shift Instructions . 74 
3.3.13 Move To/From System Register 
Instructions 


3.2.1 General Purpose Registers 


All manipulation of information is done in registers 
internal to the Fixed-Point Processor. The principal 
storage internal to the Fixed-Point Processor is a set 
of 32 General Purpose Registers (GPRs). See 
Figure 24. 





Figure 24. General Purpose Registers 


Each GPR is a 64-bit register. 
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3.2.2 Fixed-Point Exception Register 


The Fixed-Point Exception Register (XER) is a 64-bit 
register. 


XER 
0 63 


Figure 25. Fixed-Point Exception Register 


The bit definitions for the Fixed-Point Exception Reg- 
ister are shown below. Here M=0 in 64-bit mode and 
M=82 in 32-bit mode. 


The bits are set based on the operation of an instruc- 
tion considered as a whole, not on intermediate 
results (e.g., the Subtract From Carrying instruction, 
the result of which is specified as the sum of three 
values, sets bits in the Fixed-Point Exception Register 
based on the entire operation, not on an intermediate 
sum). 


Bit(s) Description 
0:31 Reserved 
32 Summary Overflow (SO) 


The Summary Overflow bit is set to 1 when- 
ever an instruction (except mtspr) sets the 
Overflow bit. Once set, the SO bit remains 
set until it is cleared by an mtspr instruction 
(specifying the XER) or an merxr instruction. 
It is not altered by Compare instructions, nor 
by other instructions (except mtspr to the 
XER, and merxr) that cannot overflow. Exe- 
cuting an mtspr instruction to the XER, sup- 
plying the values 0 for SO and 1 for OV, 
causes SO to be set to 0 and OV to be set to 
1. 


33 


34 


35:56 
57:63 


Compatibility Note 


Overflow (OV) 

The Overflow bit is set to indicate that an 
overflow has occurred during execution of an 
instruction. XO-form Add, Subtract From, and 
Negate instructions having OE=1 set it to 1 if 
the carry out of bit M is not equal to the 
carry out of bit M+1, and set it to 0 other- 
wise. XO-form Multiply Low and Divide 
instructions having OE=1 set it to 1 if the 
result cannot be represented in 64 bits 
(mulld, divd, divdu) or in 32 bits (mullw, divw, 
divwu), and set it to 0 otherwise. The OV bit 
is not altered by Compare instructions, nor 
by other instructions (except mtspr to the 
XER, and merxr) that cannot overflow. 


Carry (CA) 

The Carry bit is set as follows, during exe- 
cution of certain instructions. Add Carrying, 
Subtract From Carrying, Add Extended, and 
Subtract From Extended types of instructions 
set it to 1 if there is a carry out of bit M, and 
set it to 0 otherwise. Shift Right Algebraic 
instructions set it to 1 if any 1-bits have been 
shifted out of a negative operand, and set it 
to 0 otherwise. The CA bit is not altered by 
Compare instructions, nor by © other 
instructions (except Shift Right Algebraic, 
mtspr to the XER, and merxr) that cannot 
carry. 


Reserved 


This field specifies the number of bytes to be 
transferred by a Load String Indexed or Store 
String Indexed instruction. 





For a discussion of POWER compatibility with 
respect to XER, see Appendix E, “Incompatibilities 
with the POWER Architecture” on page 163. 
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3.3 Fixed-Point Processor Instructions 


3.3.1 Fixed-Point Storage Access Instructions 


The Storage Access instructions compute the effective 
address (EA) of the storage to be accessed as 
described in Section 1.12.2, “Effective Address 
Calculation” on page 14. 








Programming Note 


The la extended mnemonic permits computing an 
effective address as a Load or Store instruction 
would, but loads the address itself into a GPR 
rather than loading the value that is in storage at 
that address. This extended mnemonic is 
described in Section B.9, “Miscellaneous 
Mnemonics” on page 153. 








3.3.2 Fixed-Point Load Instructions 


The byte, halfword, word, or doubleword in storage 
addressed by EA is loaded into register RT. 


Many of the Load instructions have an “update” form, 
in which register RA is updated with the effective 
address. For these forms, if RA#0 and RA#RT, the 
effective address is placed into register RA and the 
storage element (byte, halfword, word, or doubleword) 
addressed by EA is loaded into RT. 


In the preferred form of the Load Quadword instruc- 
tion RA # RT+1. 








Programming Note 


The DS field in DS-form Storage Access 
instructions is a word offset, not a byte offset like 
the D field in D-form Storage Access instructions. 
However, for programming convenience, Assem- 
blers should support the specification of byte 
offsets for both forms of instruction. 








3.3.1.1 Storage Access Exceptions 


Storage accesses will cause the system data storage 
error handler to be invoked if the program is not 
allowed to modify the target storage (Store only), or if 
the program attempts to access storage that is una- 
vailable. 





Programming Note 


In some implementations, the Load Algebraic and 
Load with Update instructions may have greater 
latency than other types of Load instructions. 
Moreover, Load with Update instructions may take 
longer to execute in some implementations than 
the corresponding pair of a non-update Load 
instruction and an Add instruction. 
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Load Byte and Zero D-form 


bz RT,D(RA) 


if RA = 0 then ai 0 
else € (RA) 
FA ¢b + ec, 

RT € 560 || MEM(EA, 1) 


Let the effective address (EA) be the sum (RA|0)+D. 
The byte in storage addressed by EA is loaded into 
RTs6.63- RTo-55 are set to 0. 


Special Registers Altered: 
None 


Load Byte and Zero with Update 
D-form 


Ibzu RT,D(RA) 


EA € (RA) + EXTS(D) 
RT € 560 || MEM(EA, 1) 
RA ¢ EA 


Let the effective address (EA) be the sum (RA)+D. 
The byte in storage addressed by EA is loaded into 
RT56.63- RTo-55 are set to 0. 

EA is placed into register RA. 

If RA=0 or RA=RT, the instruction form is invalid. 


Special Registers Altered: 
None 
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Load Byte and Zero Indexed X-form 


lbzx RT,RA,RB 


31 RT RA RB 87 / 
0 6 14 16 21 31 


if RA = 0 then a) 0 
else € (RA) 
FA ¢b + (RB) 

RT € 560 || MEM(EA, 1) 


Let the effective address (EA) be the sum 
(RA|O)+(RB). The byte in storage addressed by EA is 
loaded into RTs¢.g3- RTo-55 are set to 0. 


Special Registers Altered: 
None 


Load Byte and Zero with Update 
Indexed X-form 


Ibzux RT,RA,RB 


31 RT RA RB 119 / 
0 6 14 16 21 31 


EA € (RA) + (RB) 
RT € 560 || MEM(EA, 1) 
RA € EA 


Let the effective address (EA) be the sum (RA)+(RB). 
The byte in storage addressed by EA is loaded into 
RT56.63- RTo-55 are set to 0. 

EA is placed into register RA. 

If RA=0 or RA=RT, the instruction form is invalid. 


Special Registers Altered: 
None 
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Load Halfword and Zero D-form 


Ihz RT,D(RA) 


40 RT RA D 
0 6 11 16 31 


if RA = 0 then b €0 
else b ¢€ ( 
EA ¢b + EXTS(D) 

RT € 480 || MEM(EA, 2) 


Let the effective address (EA) be the sum (RA|0)+D. 
The halfword in storage addressed by EA is loaded 
into RT4g.63. RT9.47 are set to 0. 


RA) 


Special Registers Altered: 
None 


Load Halfword and Zero with Update 
D-form 


Ihzu RT,D(RA) 


EA € (RA) + EXTS(D) 
RT € 480 || MEM(EA, 2) 
RA € EA 


Let the effective address (EA) be the sum (RA)+D. 
The halfword in storage addressed by EA is loaded 
into RT4g.63- RTo-.47 are set to 0. 

EA is placed into register RA. 

If RA=0 or RA=RT, the instruction form is invalid. 


Special Registers Altered: 
None 


Load Halfword and Zero Indexed 
X-form 


Ihzx RT,RA,RB 


31 RT RA RB 279 / 
0 6 14 16 21 31 


if RA = 0 then b €0 
else b € ( 
FA ¢b + (RB) 

RT € 480 || MEM(EA, 2) 


RA) 


Let the effective address (EA) be the sum 
(RA|0)+(RB). The halfword in storage addressed by 
EA is loaded into RT 48-63: RT 0-47 are set to 0. 


Special Registers Altered: 
None 


Load Halfword and Zero with Update 
Indexed X-form 


Ihzux RT,RA,RB 


EA € (RA) + (RB) 
RT € 480 || MEM(EA, 2) 
RA € EA 


Let the effective address (EA) be the sum (RA)+(RB). 
The halfword in storage addressed by EA is loaded 
into RT4g.63- RT-47 are set to 0. 

EA is placed into register RA. 

If RA=0 or RA=RT, the instruction form is invalid. 


Special Registers Altered: 
None 
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Load Halfword Algebraic D-form 


ha RT,D(RA) 


42 RT RA D 
0 6 11 16 31 


if RA = 0 then b €0 
else b ¢€ ( 
EA ¢b + EXTS(D) 

RT € EXTS(MEM(EA, 2)) 


Let the effective address (EA) be the sum (RA|0)+D. 
The halfword in storage addressed by EA is loaded 
into RT 49.63. RTp-47 are filled with a copy of bit 0 of 
the loaded halfword. 


RA) 


Special Registers Altered: 
None 


Load Halfword Algebraic with Update 
D-form 


lhau RT,D(RA) 


43 RT RA D 


EA € (RA) + EXTS(D) 
RT € EXTS(MEM(EA, 2)) 
RA € EA 


Let the effective address (EA) be the sum (RA)+D. 
The halfword in storage addressed by EA is loaded 
into RT4g.63- ~RTo.47 are filled with a copy of bit 0 of 
the loaded halfword. 


EA is placed into register RA. 
If RA=0 or RA=RT, the instruction form is invalid. 


Special Registers Altered: 
None 
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Load Halfword Algebraic Indexed 
X-form 


lhax RT,RA,RB 


31 RT RA RB 343 / 
0 6 11 16 21 31 


if RA = 0 then b €0 
else b € ( 
EA ¢b + (RB) 

RT € EXTS(MEM(EA, 2)) 


RA) 


Let the effective address (EA) be the sum 
(RA|0)+(RB). The halfword in storage addressed by 
EA is loaded into RT4g.¢3-_ RT9-47 are filled with a copy 
of bit 0 of the loaded halfword. 


Special Registers Altered: 
None 


Load Halfword Algebraic with Update 
Indexed X-form 


Ihaux RT,RA,RB 


31 RT RA RB 375 / 


oO 
o 
= 
= 
oF 
nm 
Ls 
) 
y 


EA € (RA) + (RB) 
RT € EXTS(MEM(EA, 2)) 
RA ¢ EA 


Let the effective address (EA) be the sum (RA)+(RB). 
The halfword in storage addressed by EA is loaded 
into RT4g.63. RTo.47 are filled with a copy of bit 0 of 
the loaded halfword. 


EA is placed into register RA. 
If RA=0 or RA=RT, the instruction form is invalid. 


Special Registers Altered: 
None 
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Load Word and Zero D-form 


lwz RT,D(RA) 
[POWER mnemonic: |] 


32 RT RA D 
0 6 11 16 31 


if RA = 0 then b €0 
else b ¢€ ( 
EA ¢b + EXTS(D) 

RT € 820 || MEM(EA, 4) 


Let the effective address (EA) be the sum (RA|0)+D. 
The word in storage addressed by EA is loaded into 
RT 39.63. RT9-31 are set to 0. 


RA) 


Special Registers Altered: 
None 


Load Word and Zero with Update 
D-form 


Iwzu RT,D(RA) 
[POWER mnemonic: lu] 


33 RT RA D 
0 6 11 16 31 


EA € (RA) + EXTS(D) 
RT € 920 || MEM(EA, 4) 
RA € EA 


Let the effective address (EA) be the sum (RA)+D. 
The word in storage addressed by EA is loaded into 
RT 39.63. RT9-3; are set to 0. 

EA is placed into register RA. 

If RA=0 or RA=RT, the instruction form is invalid. 


Special Registers Altered: 
None 


Load Word and Zero Indexed X-form 


lwzx RT,RA,RB 
[POWER mnemonic: |x] 


31 RT RA RB 23 / 
0 6 11 16 21 31 


if RA = 0 then b €0 
else b ¢€ ( 
FA ¢b + (RB) 

RT € 820 || MEM(EA, 4) 


RA) 


Let the effective address (EA) be the sum 
(RA|0)+(RB). The word in storage addressed by EA 
is loaded into RT35.63. RT9-3; are set to 0. 


Special Registers Altered: 
None 


Load Word and Zero with Update 
Indexed X-form 


Iwzux RT,RA,RB 
[POWER mnemonic: lux] 


31 RT RA RB 55 / 
0 6 11 16 21 31 


EA € (RA) + (RB) 
RT € 920 || MEM(EA, 4) 
RA ¢ EA 


Let the effective address (EA) be the sum (RA)+(RB). 
The word in storage addressed by EA is loaded into 
RT 39.63. RTg-31 are set to 0. 

EA is placed into register RA. 

If RA=0 or RA=RT, the instruction form is invalid. 


Special Registers Altered: 
None 
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Load Word Algebraic DS-form 


wa RT,DS(RA) 


58 RT RA DS 2 
0 6 11 16 30 31 


if RA = 0 then b €0 
else b € (RA) 
EA ¢b + EXTS(DS II 0b00) 
RT € EXTS(MEM(EA, 4)) 


Let the effective address (EA) be the sum 
(RA|0)+(DSII0b00). The word in storage addressed 
by EA is loaded into RT39.¢3._ RTo.3; are filled with a 
copy of bit 0 of the loaded word. 


Special Registers Altered: 
None 
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Load Word Algebraic Indexed X-form 


lwax RT,RA,RB 


31 RT RA RB 341 / 
0 6 14 16 21 31 


if RA = 0 then b €0 
else b ¢€ ( 
EA ¢b + (RB) 

RT € EXTS(MEM(EA, 4)) 


RA) 


Let the effective address (EA) be the sum 
(RA|O)+(RB). The word in storage addressed by EA 
is loaded into RT39.63. RTo.3; are filled with a copy of 
bit 0 of the loaded word. 


Special Registers Altered: 
None 


Load Word Algebraic with Update 
Indexed X-form 


Iwaux RT,RA,RB 


31 RT RA RB 373 / 
0 6 11 16 21 31 


EA € (RA) + (RB) 
RT € EXTS(MEM(EA, 4)) 
RA ¢ EA 


Let the effective address (EA) be the sum (RA)+(RB). 
The word in storage addressed by EA is loaded into 
RT39.63- RT9.3; are filled with a copy of bit 0 of the 
loaded word. 


EA is placed into register RA. 
If RA=0 or RA=RT, the instruction form is invalid. 


Special Registers Altered: 
None 
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Load Doubleword DS-form 


Id RT,DS(RA) 


0 
30 31 


if RA = 0 then b €0 


else b € (RA) 
EA ¢b + EXTS(DS II 0b00) 
RT € MEM(EA, 8) 


Let the effective address (EA) be the sum 
(RA|0)+(DSII0b00). The doubleword in storage 
addressed by EA is loaded into RT. 


Special Registers Altered: 
None 


Load Doubleword with Update DS-form 


Idu RT,DS(RA) 


1 
30 31 


EA € (RA) + EXTS(DS Il 0b00) 
RT ¢ MEM(EA, 8) 
RA ¢ EA 


Let the effective address (EA) be the sum 
(RA)+(DSII0b00). The doubleword in storage 
addressed by EA is loaded into RT. 

EA is placed into register RA. 

If RA=0 or RA=RT, the instruction form is invalid. 


Special Registers Altered: 
None 


Load Doubleword Indexed X-form 


Idx RT,RA,RB 


31 RT RA RB 21 / 
0 6 14 16 21 31 


if RA = 0 then es 0 
else ¢€ (RA) 
EA ¢b + (RB) 

RT € MEM(EA, 8) 


Let the effective address (EA) be the sum 
(RA|0)+(RB). The doubleword in storage addressed 
by EA is loaded into RT. 


Special Registers Altered: 
None 


Load Doubleword with Update Indexed 
X-form 


Idux RT,RA,RB 


31 RT RA RB 53 / 
0 6 11 16 21 31 


EA € (RA) + (RB) 
RT ¢ MEM(EA, 8) 
RA ¢ EA 


Let the effective address (EA) be the sum (RA)+(RB). 
The doubleword in storage addressed by EA is loaded 
into RT. 

EA is placed into register RA. 

If RA=0 or RA=RT, the instruction form is invalid. 


Special Registers Altered: 
None 
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3.3.3 Fixed-Point Store Instructions 


The contents of register RS are stored into the byte, 
halfword, word, or doubleword in storage addressed 
by EA. 


Many of the Store instructions have an “update” form, 
in which register RA is updated with the effective 
address. For these forms, the following rules apply. 


Store Byte D-form 


stb RS,D(RA) 


RS 
31 


if RA = 0 then 0 
else ¢ (RA) 
FA ¢b+ stolen 
MEM(EA, 1) € (RS) 56.63 


Let the effective address (EA) be the sum (RA|0)+D. 
(RS)56.63 are stored into the byte in storage addressed 
by EA. 


Special Registers Altered: 
None 


Store Byte with Update D-form 


stbu RS,D(RA) 


RS 
31 


EA € (RA) + EXTS(D) 
MEM(EA, 1) € (RS) 56.63 
RA € EA 


Let the effective address (EA) be the sum (RA)+D. 
(RS)s¢.¢3 are stored into the byte in storage addressed 
by EA. 


EA is placed into register RA. 
lf RA=0, the instruction form is invalid. 


Special Registers Altered: 
None 


mu lf RA40, the effective address is placed into reg- 
ister RA. 


= If RS=RA, the contents of register RS are copied 
to the target storage element and then EA is 
placed into RA (RS). 


Store Byte Indexed X-form 


stbx RS,RA,RB 


31 RS RA RB 215 / 
0 6 11 16 21 31 


if RA = 0 then a 0 
else ¢€ (RA) 
EA ¢b + (RB) 

MEM(EA, 1) € (RS) 56-69 


Let the effective address (EA) be the sum 
(RA|O)+(RB). (RS)sg.63 are stored into the byte in 
storage addressed by EA. 


Special Registers Altered: 
None 


Store Byte with Update Indexed X-form 


stbux RS,RA,RB 


31 RS RA RB 247 / 
0 6 11 16 21 31 


EA ¢ (RA) + (RB) 
MEM(EA, 1) € (RS) 56.63 
RA ¢ EA 


Let the effective address (EA) be the sum (RA)+(RB). 
(RS)56.63 are stored into the byte in storage addressed 


by EA 
EA is placed into register RA. 
If RA=0, the instruction form is invalid. 


Special Registers Altered: 
None 
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Store Halfword D-form 


sth RS,D(RA) 


RS 
31 


if RA = 0 then a) 0 
else € (RA) 
FA ¢b+ eeiater 
MEM(EA, 2) € (RS) ag-eg 


Let the effective address (EA) be the sum (RA|0)+D. 
(RS)4g.63 are stored into the halfword in storage 
addressed by EA. 


Special Registers Altered: 
None 


Store Halfword with Update D-form 


sthu RS,D(RA) 


RS 
31 


EA € (RA) + EXTS(D) 
MEM(EA, 2) (RS) 49-63 
RA € EA 


Let the effective address (EA) be the sum (RA)+D. 
(RS)4g:63 are stored into the halfword in storage 
addressed by EA. 

EA is placed into register RA. 


If RA=0, the instruction form is invalid. 


Special Registers Altered: 
None 


Store Halfword Indexed X-form 


sthx RS,RA,RB 


31 RS RA RB 407 / 
0 6 11 16 21 31 


if RA = 0 then es 0 
else € (RA) 
EA €b + (RB) 

MEM(EA, 2) € (RS) agg 


Let the effective address (EA) be the sum 
(RA|O)+(RB). (RS)4g.63 are stored into the halfword in 
storage addressed by EA. 


Special Registers Altered: 
None 


Store Halfword with Update Indexed 
X-form 


sthux RS,RA,RB 


31 RS RA RB 439 / 
0 6 11 16 21 31 


EA ¢ (RA) + (RB) 
MEM(EA, 2) (RS) 4g.69 
RA ¢ EA 


Let the effective address (EA) be the sum (RA)+(RB). 
(RS)4g.63 are stored into the halfword in storage 
addressed by EA. 

EA is placed into register RA. 


If RA=0, the instruction form is invalid. 


Special Registers Altered: 
None 
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Store Word D-form 


stw RS,D(RA) 
[POWER mnemonic: st] 


36 RS RA D 
0 6 11 16 31 


if RA = 0 then b €0 
else b € (RA) 
EA ¢b + EXTS(D) 
MEM(EA, 4) € (RS) 9.9 


Let the effective address (EA) be the sum (RA|0)+D. 
(RS)30.63 are stored into the word in storage 
addressed by EA. 


Special Registers Altered: 
None 


Store Word with Update D-form 


stwu RS,D(RA) 
[POWER mnemonic: stu] 


37 RS RA D 
0 6 11 16 31 


EA € (RA) + EXTS(D) 
MEM(EA, 4) (RS) 39.63 
RA ¢ EA 


Let the effective address (EA) be the sum (RA)+D. 
(RS)39.63 are stored into the word in storage 
addressed by EA. 

EA is placed into register RA. 


If RA=0, the instruction form is invalid. 


Special Registers Altered: 
None 


Store Word Indexed X-form 


stwx RS,RA,RB 
[POWER mnemonic: stx] 


31 RS RA RB 151 / 
0 6 11 16 21 31 


if RA = 0 then b €0 
else b € (RA) 
EA €b + (RB) 

MEM(EA, 4) € (RS) 30.6 


Let the effective address (EA) be the sum 
(RA|O)+(RB). (RS)30.¢3 are stored into the word in 
storage addressed by EA. 


Special Registers Altered: 
None 


Store Word with Update Indexed X-form 


stwux RS,RA,RB 
[POWER mnemonic: stux] 


31 RS RA RB 183 / 
0 6 11 16 21 31 


EA ¢ (RA) + (RB) 
MEM(EA, 4) € (RS) 30.63 
RA ¢ EA 


Let the effective address (EA) be the sum (RA)+(RB). 
(RS)30.63 are stored into the word in storage 
addressed by EA. 

EA is placed into register RA. 


lf RA=0, the instruction form is invalid. 


Special Registers Altered: 
None 
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Store Doubleword DS-form 


std RS,DS(RA) 


RS 0 
30 31 


if RA = 0 then b €0 
else b € (RA) 
EA ¢b + EXTS(DS II 0b00) 
MEM(EA, 8) (RS) 


Let the effective address (EA) be the sum 
(RA|0)+(DSII0b00). (RS) is stored into the 
doubleword in storage addressed by EA. 


Special Registers Altered: 
None 


Store Doubleword with Update DS-form 


stdu RS,DS(RA) 


RS 1 
30 31 


EA € (RA) + EXTS(DS II 0b00) 
MEM(EA, 8) € (RS) 
RA ¢ EA 


Let the effective address (EA) be the sum 
(RA)+(DSII0b00). (RS) is stored into the doubleword 
in storage addressed by EA. 

EA is placed into register RA. 


If RA=0, the instruction form is invalid. 


Special Registers Altered: 
None 


Store Doubleword Indexed X-form 


stdx RS,RA,RB 


31 RS RA RB 149 / 
0 6 11 16 21 31 


if RA = 0 then es 0 
else € (RA) 
EA ¢€b + (RB) 

MEM(EA, 8) € (RS) 


Let the effective address (EA) be the sum 
(RA|0)+(RB). (RS) is stored into the doubleword in 
storage addressed by EA. 


Special Registers Altered: 
None 


Store Doubleword with Update Indexed 
X-form 


stdux RS,RA,RB 


31 RS RA RB 181 / 
0 6 11 16 21 31 


FA € (RA) + (RB) 
MEM(EA, 8) € (RS) 
RA ¢ EA 


Let the effective address (EA) be the sum (RA)+(RB). 
(RS) is stored into the doubleword in storage 
addressed by EA. 

EA is placed into register RA. 


If RA=0, the instruction form is invalid. 


Special Registers Altered: 
None 
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3.3.4 Fixed-Point Load and Store with Byte Reversal Instructions 





Programming Note 


These instructions have the effect of loading and 
storing data in Little-Endian byte order. 


In some implementations, the Load Byte-Reverse 
instructions may have greater latency than other 
Load instructions. 








Load Halfword Byte-Reverse Indexed 
X-form 


Ihbrx RT,RA,RB 


31 RT RA RB 790 / 
0 6 11 16 21 31 


if RA = 0 then b €0 
else b ¢€ ( 
EA ¢€b + (RB) 

RT € 480 || MEM(EA+1, 1) Il MEM(EA, 1) 


Let the effective address (EA) be the sum 
(RA|O)+(RB). Bits 0:7 of the halfword in storage 
addressed by EA are loaded into RTs¢.63. Bits 8:15 of 
the halfword in storage addressed by EA are loaded 
into RT4g-55. RTo-.47 are set to 0. 


RA) 


Special Registers Altered: 
None 


Load Word Byte-Reverse Indexed 
X-form 


Iwbrx RT,RA,RB 
[POWER mnemonic: Ibrx] 


31 RT RA RB 534 / 
0 6 11 16 21 31 


if RA = 0 then b €0 

else b ¢ ( 

FA ¢b + (RB) 

RT € 820 || MEM(EA+3, 1) Il MEM(EA+2, 1) 
Il MEM(EA+1, 1) Il MEM(EA, 1) 


Let the effective address (EA) be the sum 
(RA|0)+(RB). Bits 0:7 of the word in storage 
addressed by EA are loaded into RTs¢.63. Bits 8:15 of 
the word in storage addressed by EA are loaded into 
RT43.55- Bits 16:23 of the word in storage addressed 
by EA are loaded into RT49.47. Bits 24:31 of the word 
in storage addressed by EA are loaded into RT49.39. 
RTo-31 are set to 0. 


RA) 


Special Registers Altered: 
None 
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Store Halfword Byte-Reverse Indexed 
X-form 


sthbrx 


31 RS RA RB 918 / 
0 6 11 16 21 31 


if RA = 0 then b €0 
else b ¢€ ( 
EA ¢b + (RB) 

MEM(EA, 2) € (RS) 56.63 I (RS) 49:55 


RS,RA,RB 


RA) 


Let the effective address (EA) be the sum 
(RA|O)+(RB). (RS)s¢.¢3 are stored into bits 0:7 of the 
halfword in storage addressed by EA. (RS)gg.s5 are 
stored into bits 8:15 of the halfword in storage 
addressed by EA. 


Special Registers Altered: 
None 


Store Word Byte-Reverse Indexed 
X-form 


stwbrx RS,RA,RB 
[POWER mnemonic: stbrx] 


31 RS RA RB 662 / 
0 6 11 16 21 31 


if RA = 0 then b €0 
else b ¢ ( 
EA ¢b + (RB) 

MEM(EA, 4) € (RS)sge3 Il (RS) agsg Il (RS) 4g-a7 ll (RS) 39-99 


Let the effective address (EA) be the sum 
(RA|O)+(RB). (RS)s¢.g3 are stored into bits 0:7 of the 
word in storage addressed by EA. (RS)4g-55 are stored 
into bits 8:15 of the word in storage addressed by EA. 
(RS)49-47 are stored into bits 16:23 of the word in 
storage addressed by EA. (RS)39.39 are stored into 
bits 24:31 of the word in storage addressed by EA. 


RA) 


Special Registers Altered: 
None 
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3.3.5 Fixed-Point Load and Store Multiple Instructions 


The Load/Store Multiple instructions have preferred 
forms; see Section 1.9.1, “Preferred Instruction 
Forms” on page 13. In the preferred forms, storage 
alignment satisfies the following rule. 


= The combination of the EA and RT (RS) is such 
that the low-order byte of GPR 31 is loaded 
(stored) from (into) the last byte of an aligned 
quadword in storage. 


Load Multiple Word D-form 


Imw RT,D(RA) 
[POWER mnemonic: Im] 


46 RT RA D 
0 6 11 16 31 


if RA = 0 then b €0 
else b € (RA) 
EA ¢b + EXTS(D) 

vr €RT 


do while r < 31 
GPR(r) € 320 || MEM(EA, 4) 
ré¢erel 
EA ¢ EA + 4 


Let n = (82-RT). Let the effective address (EA) be 
the sum (RA|0)+D. 


n consecutive words starting at EA are loaded into 
the low-order 32 bits of GPRs RT through 31. The 
high-order 32 bits of these GPRs are set to zero. 


If RA is in the range of registers to be loaded, 
including the case in which RA=0, the instruction 
form is invalid. 


Special Registers Altered: 
None 





Compatibility Note 


For a discussion of POWER compatibility with 
respect to the alignment of the EA for the Load 
Multiple Word and _ Store Multiple Word 


instructions, see Appendix E, “Incompatibilities 
with the POWER Architecture” on page 163. For 
compatibility with future versions of the PowerPC 
Architecture, these EAs should be word-aligned. 








Store Multiple Word D-form 


stmw RS,D(RA) 
[POWER mnemonic: stm] 


47 RS RA D 
0 6 11 16 31 


if RA = 0 then b €0 
else b € (RA) 
EA ¢b + EXTS(D) 

vr €RS 


do while r < 31 
ré¢rel 
EA ¢ EA + 4 


Let n = (32-RS). Let the effective address (EA) be 
the sum (RA|0)+D. 


n consecutive words starting at EA are stored from 
the low-order 32 bits of GPRs RS through 31. 


Special Registers Altered: 
None 
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3.3.6 Fixed-Point Move Assist Instructions 


The Move Assist instructions allow movement of data 
from storage to registers or from registers to storage 
without concern for alignment. These instructions can 
be used for a short move between arbitrary storage 
locations or to initiate a long move between unaligned 
storage fields. 


The Load/Store String instructions have preferred 
forms; see Section 1.9.1, “Preferred Instruction 
Forms” on page 13. In the preferred forms, register 
usage satisfies the following rules. 


mw RS= 4or5 
mw RT= 4or5 
m last register loaded/stored < 12 


For some implementations, using GPR 4 for RS and 
RT may result in slightly faster execution than using 
GPR 5; see Book IV, PowerPC Implementation Fea- 
tures. 
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Load String Word Immediate X-form 


Iswi RT,RA,NB 
[POWER mnemonic: Isi] 


31 RT RA NB 597 / 
0 6 11 16 21 31 


if RA 0 then EA € 0 
else EA € (RA) 
if NB = 0 then n € 32 
else n €¢NB 
yr ¢RT-1 
1 € 32 
do while n > 0 
if i = 32 then 
vr ¢r +1 (mod 32) 
GPR(r) €0 
GPR(r);.;,7 ¢ MEM(EA, 1) 
1€i+ 8 
if i = 64 then i € 32 
EA ¢ EA +1 
né€n-1 


Let the effective address (EA) be (RA|O). Letn = NB 
if NB#0, n = 32 if NB=0; nis the number of bytes to 
load. Let nr = CEIL(n+4); nr is the number of regis- 
ters to receive data. 


n consecutive bytes starting at EA are loaded into 
GPRs RT through RT+nr—1. Data are loaded into the 
low-order four bytes of each GPR; the high-order four 
bytes are set to 0. 


Bytes are loaded left to right in each register. The 
sequence of registers wraps around to GPR 0 if 
required. If the low-order four bytes of register 
RT+nr—1 are only partially filled, the unfilled low- 
order byte(s) of that register are set to 0. 


If RA is in the range of registers to be loaded, 
including the case in which RA=0, the instruction 
form is invalid. 


Special Registers Altered: 
None 


Load String Word Indexed X-form 


Iswx RT,RA,RB 
[POWER mnemonic: Isx] 


31 RT RA RB 533 / 
0 6 11 16 21 31 


if RA = 0 then b €0 
else b € ( 
EA €b + (RB) 
n € XER57.63 
vr ¢RT-1 
i € 32 
RT € undefined 
do while n > 0 
if i = 32 then 
vr ¢r +1 (mod 32) 
GPR(r) €0 
GPR(r)j.;,7 € MEM(EA, 1) 
1¢€i+ 8 
if i = 64 then i € 32 
EA ¢€ EA +1 
né¢n-1 


RA) 


Let the effective address (EA) be the sum 
(RA|O)+(RB). Let n = XERs7.g3; n is the number of 
bytes to load. Let nr = CEIL(n+4); nr is the number 
of registers to receive data. 


If n>0, n consecutive bytes starting at EA are loaded 
into GPRs RT through RT+nr—1. Data are loaded 
into the low-order four bytes of each GPR; the high- 
order four bytes are set to 0. 


Bytes are loaded left to right in each register. The 
sequence of registers wraps around to GPR 0 if 
required. If the low-order four bytes of register 
RT+nr—1 are only partially filled, the unfilled low- 
order byte(s) of that register are set to 0. 


If n=0, the contents of register RT are undefined. 


If RA or RB is in the range of registers to be loaded, 
including the case in which RA=0, either the system 
illegal instruction error handler is invoked or the 
results are boundedly undefined. If RT=RA _ or 
RT=RB, the instruction form is invalid. 


Special Registers Altered: 
None 
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Store String Word Immediate X-form 


stswi RS,RA,NB 
[POWER mnemonic: stsi] 


31 RS RA NB 725 / 
0 6 11 16 21 31 


if RA = 0 then EA ¢ 0 
else EA € (RA) 
if NB = 0 then n € 32 
else n €NB 

vr €¢RS-1 

1 € 32 


do while n > 0 
if i = 32 then r ¢r +1 (mod 32) 
MEM(EA, 1) € GPR(r)j.:47 
1€i1+ 8 
if i = 64 then i € 32 
EA ¢€ EA +1 
né€n-1 


Let the effective address (EA) be (RA|O). Letn = NB 
if NB#0, n = 32 if NB=0; nis the number of bytes to 
store. Let nr = CEIL(n+4); nr is the number of regis- 
ters to supply data. 


n consecutive bytes starting at EA are stored from 
GPRs RS through RS+nr—1. Data are stored from 
the low-order four bytes of each GPR. 


Bytes are stored left to right from each register. The 
sequence of registers wraps around to GPR 0 if 
required. 


Special Registers Altered: 
None 


Store String Word Indexed X-form 


stSwx RS,RA,RB 
[POWER mnemonic: stsx] 


31 RS RA RB 661 / 
0 6 11 16 21 31 


if RA = 0 then b €0 
else b ¢€ ( 
EA ¢€b + (RB) 
n € XER57.63 
r€¢RsS-1 
i € 32 
do while n > 0 
if i = 32 then r ¢r +1 (mod 32) 
MEM(EA, 1) € GPR(r);-i47 
1€i+ 8 
if i = 64 then i € 32 
EA ¢€ EA +1 
néen-1 


RA) 


Let the effective address (EA) be the sum 
(RA|O)+(RB). Let n = XERs7.63; n is the number of 
bytes to store. Let nr = CEIL(n+4); nr is the number 
of registers to supply data. 


If n>0, n consecutive bytes starting at EA are stored 
from GPRs RS through RS+nr-—1. Data are stored 
from the low-order four bytes of each GPR. 


Bytes are stored left to right from each register. The 
sequence of registers wraps around to GPR 0 if 
required. 


If n=0, no bytes are stored. 


Special Registers Altered: 
None 
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3.3.7 Other Fixed-Point Instructions 


The remainder of the fixed-point instructions use the 
contents of the General Purpose Registers (GPRs) as 
source operands, and place results into GPRs, into the 
Fixed-Point Exception Register (XER), and into Condi- 
tion Register fields. In addition, the Trap instructions 
test the contents of a GPR or XER bit, invoking the 
system trap handler if the result of the specified test 
is true. 


These instructions treat the source operands as 
signed integers unless the instruction is explicitly 
identified as performing an unsigned operation. 


The X-form and XO-form instructions with Rc=1, and 
the D-form instructions addic., andi., and andis., set 
the first three bits of CR Field 0 to characterize the 


result placed into the target register. In 64-bit mode, 
these bits are set by signed comparison of the result 
to zero. In 32-bit mode, these bits are set by signed 
comparison of the low-order 32 bits of the result to 
zero. 


Unless otherwise noted and when appropriate, when 
CR Field 0 and the XER are set they reflect the value 
placed into the target register. 





Programming Note 


Instructions with the OE bit set or that set CA may 
execute slowly or may prevent the execution of 
subsequent instructions until the instruction has 
completed. 
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3.3.8 Fixed-Point Arithmetic Instructions 


The XO-form Arithmetic instructions with Rc=1, and 
the D-form Arithmetic instruction addic., set the first 
three bits of CR Field 0 as described in Section 3.3.7, 
“Other Fixed-Point Instructions” on page 48. 


addic, addic., subfic, addc, subfc, adde, subfe, addme, 
subfme, addze, and subfze always set CA, to reflect 
the carry out of bit 0 in 64-bit mode and out of bit 32 
in 32-bit mode. The XO-form Arithmetic instructions 
set SO and OV when OE=1 to reflect overflow of the 
result. Except for the Multiply Low and Divide 
instructions, the setting of these bits is mode- 
dependent, and reflects overflow of the 64-bit result in 
64-bit mode and overflow of the low-order 32-bit 
result in 32-bit mode. For XO-form Multiply Low and 
Divide instructions, the setting of these bits is mode- 
independent, and reflects overflow of the 64-bit result 
for mulld, divd, and divdu, and overflow of the low- 
order 32-bit result for mullw, divw, and divwu. 





Programming Note 


Notice that CR Field 0 may not reflect the “true” 
(infinitely precise) result if overflow occurs. 








Add Immediate D-form 


addi RT,RA,SI 
[POWER mnemonic: cal] 


14 RT RA Sl 
0 6 1 16 31 


if RA = 0 then RT ¢ EXTS(SI) 
else RT € (RA) + EXTS(SI) 


The sum (RA|O) + Sl is placed into register RT. 


Special Registers Altered: 
None 


Extended Mnemonics: 
Examples of extended mnemonics for Add Immediate: 


Extended: Equivalent to: 

li Rx,value addi Rx,0,value 

la Rx,disp(Ry) addi _ _ Rx,Ry,disp 
subi Rx,Ry,value addi Rx,Ry,—value 


Programming Note 


addi, addis, add, and subf are the preferred 
instructions for addition and subtraction, because 
they set few status bits. 


Notice that addi and addis use the value 0, not the 
contents of GPR 0, if RA=0. 








Extended mnemonics for addition and 
subtraction 


Several extended mnemonics are provided that use 
the Add Immediate and Add Immediate Shifted 
instructions to load an immediate value or an address 
into a target register. Some of these are shown as 
examples with the two instructions. 


The PowerPC Architecture supplies Subtract From 
instructions, which subtract the second operand from 
the third. A set of extended mnemonics is provided 
that use the more “normal” order, in which the third 
operand is subtracted from the second, with the third 
operand being either an immediate field or a register. 
Some of these are shown as examples with the appro- 
priate Add and Subtract From instructions. 


See Appendix B, “Assembler Extended Mnemonics” 
on page 143 for additional extended mnemonics. 


Add Immediate Shifted D-form 


addis RT,RA,SI 
[POWER mnemonic: cau] 


15 RT RA Sl 
0 6 1 16 31 


if RA = 0 then RT € EXTS(SI II 180) 
else RT € (RA) + EXTS(SI II 180) 


The sum (RA|O) + (SI Il 0x0000) is placed into reg- 
ister RT. 


Special Registers Altered: 
None 
Extended Mnemonics: 
Examples of extended mnemonics for Add Immediate 
Shifted: 
Extended: 


lis Rx,value 
subis Rx,Ry,value 


Equivalent to: 


addis Rx,0,value 
addis Rx,Ry,—value 
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Add XO-form 

add RT,RA,RB (OE=0 Rc=0) 
add. RT,RA,RB (OE=0 Re=1) 
addo RT,RA,RB (OE=1 Rc=0) 
addo. RT,RA,RB (OE=1 Re=1) 


[POWER mnemonics: cax, cax., caxo, caxo.] 


31 RT RA RB |OE 266 Re 
0 6 1 16 21 | 22 31 
RT € (RA) + (RB) 
The sum (RA) + (RB) is placed into register RT. 
Special Registers Altered: 


CRO (if Re=1) 
SO OV (if OE=1) 


Add Immediate Carrying D-form 


addic RT,RA,SI 
[POWER mnemonic: ai] 


12 RT RA Sl 
0 6 11 16 31 


RT € (RA) + EXTS(SI) 
The sum (RA) + Sl is placed into register RT. 
Special Registers Altered: 
CA 
Extended Mnemonics: 
Example of extended mnemonics for Add Immediate 


Carrying: 


Extended: 
subic Rx,Ry,value 


Equivalent to: 
addic Rx,Ry,—value 


Subtract From XO-form 


subf RT,RA,RB (OE=0 Rc=0) 
subf. RT,RA,RB (OE=0 Rc=1) 
subfo RT,RA,RB (OE=1 Rc=0) 
subfo. RT,RA,RB (OE=1 Re=1) 


31 RT RA RB |OE 40 Re 
0 6 11 16 21 | 22 31 
RT € 3(RA) + (RB) + 1 
The sum =(RA) + (RB) +1 is placed into register 


RT. 


Special Registers Altered: 
CRO (if Rc=1) 
SO OV (if OE=1) 
Extended Mnemonics: 
Example of extended mnemonics for Subtract From: 


Extended: Equivalent to: 
sub Rx,Ry,Rz subf Rx,Rz,Ry 


Add Immediate Carrying and Record 
D-form 


addic. RT,RA,SI 
[POWER mnemonic: ai.] 


13 RT RA Sl 
0 6 11 16 31 


RT € (RA) + EXTS(SI) 
The sum (RA) + Sl is placed into register RT. 


Special Registers Altered: 
CRO CA 
Extended Mnemonics: 
Example of extended mnemonics for Add Immediate 
Carrying and Record: 


Extended: 
subic. Rx,Ry,value 


Equivalent to: 
addic. Rx,Ry,—value 
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Subtract From Immediate Carrying 
D-form 


subfic RT,RA,SI 
[POWER mnemonic: sfi] 


8 RT RA Sl 
0 6 11 16 31 


RT € 3(RA) + EXTS(SI) + 1 
The sum =(RA) + SI + 1 is placed into register RT. 


Special Registers Altered: 
CA 


Add Carrying XO-form 


addc RT,RA,RB (OE=0 Rc=0) 
addc. RT,RA,RB (OE=0 Rce=1) 
addco RT,RA,RB (OE=1 Rc=0) 
addco. RT,RA,RB (OE=1 Re=1) 


[POWER mnemonics: a, a., ao, ao.] 


31 RT RA RB |OE 10 Re 
0 6 11 16 21 | 22 31 
RT ¢ (RA) + (RB) 


The sum (RA) + (RB) is placed into register RT. 


Special Registers Altered: 


CA 
CRO (if Re=1) 
SO OV (if OE=1) 


Subtract From Carrying XO-form 


subfc RT,RA,RB (OE=0 Rc=0) 
subfc. RT,RA,RB (OE=0 Ro=1) 
subfco RT,RA,RB (OE=1 Rc=0) 
subfco. — RT,RA,RB (OE=1 Re=1) 


[POWER mnemonics: sf, sf., sfo, sfo.] 


31 RT RA RB |OE 8 Re 
0 6 11 16 21 | 22 31 
RT € =(RA) + (RB) +1 


The sum -=(RA) + (RB) + 1 is placed into register 


RT. 

Special Registers Altered: 
CA 
CRO (if Re=1) 
SO OV (if OE=1) 


Extended Mnemonics: 
Example of extended mnemonics for Subtract From 
Carrying: 


Extended: 
subc Rx, Ry,Rz 


Equivalent to: 
subfc Rx,Rz,Ry 
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Add Extended XO-form 


adde RT,RA,RB (OE=0 Rc=0) 
adde. RT,RA,RB (OE=0 Ro=1) 
addeo RT,RA,RB (OE=1 Rc=0) 
addeo. RT,RA,RB (OE=1 Re=1) 


[POWER mnemonics: ae, ae., aeo, aeo.] 


31 RT RA RB |OE 138 Re 
0 6 11 16 21 | 22 31 
RT ¢ (RA) + (RB) + CA 


The sum (RA) + (RB) + CA is placed into register 


RT. 

Special Registers Altered: 
CA 
CRO (if Re=1) 
SO OV (if OE=1) 


Add to Minus One Extended XO-form 
addme RT,RA 


addme. RT,RA 


(OES 0 Rc=0) 
( =1) 
addmeo RT,RA (OE= 1 Ries 0) 
( ) 
0. 


addmeo. RT,RA OE=1 Rc=1 


[POWER mnemonics: ame, ame., ameo, ameo.] 


31 RT RA Ill OE 234 Re 
0 6 11 16 21 | 22 31 


RT € (RA) + CA-1 
The sum (RA) + CA+ ®1 is placed into register RT. 


Special Registers Altered: 


CA 
CRO (if Re=1) 
SO OV (if OE=1) 


Subtract From Extended XO-form 


subfe RT,RA,RB (OE=0 Rc=0) 
subfe. RT,RA,RB (OE=0 Ro=1) 
subfeo RT,RA,RB (OE=1 Rc=0) 
subfeo.  RT,RA,RB (OE=1 Re=1) 


[POWER mnemonics: sfe, sfe., sfeo, sfeo.] 


31 RT RA RB |OE 136 Re 
0 6 11 16 21 | 22 31 
RT ¢€ -(RA) + (RB) + CA 


The sum =(RA) + (RB) + CA is placed into register 
RT. 


Special Registers Altered: 


CA 
CRO (if Re=1) 
SO OV (if OE=1) 


Subtract From Minus One Extended 
XO-form 


subfme RT,RA (OE=0 Rc=0) 
subfme. RT,RA (OE=0 Rc=1) 
subfmeo  RT,RA (OE=1 Rc=0) 
subfmeo. RT,RA (OE=1 Rce=1) 


[POWER mnemonics: sfme, sfme., sfmeo, sfmeo.] 


31 RT RA 11 OE 232 Re 
0 6 11 16 21 | 22 31 


RT €4(RA) + CA-1 
The sum =(RA) + CA + 41 is placed into register 
RT. 


Special Registers Altered: 
CA 


CRO 
SO OV 
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Add to Zero Extended XO-form 


addze RT,RA (OE=0 Rc=0) 
addze. RT,RA (OE=0 Rc=1) 
addzeo RT,RA (OE=1 Rc=0) 
addzeo. RT,RA (OE=1 Rc=1) 


[POWER mnemonics: aze, aze., azeo, azeo.] 


31 RT RA 11 OE 202 Re 
0 6 11 16 21 | 22 31 


RT € (RA) + CA 
The sum (RA) + CA is placed into register RT. 


Special Registers Altered: 


CA 
CRO (if Re=1) 
SO OV (if OE=1) 


Subtract From Zero Extended XO-form 


subfze RT,RA (OE=0 Rc=0) 
subfze. RT,RA (OE=0 Rc=1) 
subfzeo RT,RA (OE=1 Rc=0) 
subfzeo. RT,RA (OE=1 Rc=1) 


[POWER mnemonics: sfze, sfze., sfzeo, sfzeo.] 


31 RT RA Ill \OE 200 {Re 
0 6 11 16 21 | 22 31 


RT € 3(RA) + CA 
The sum =(RA) + CA is placed into register RT. 


Special Registers Altered: 


CA 
CRO (if Re=1) 
SO OV (if OE=1) 





Programming Note 


The setting of CA by the Add and Subtract From 
instructions, including the Extended versions 
thereof, is mode-dependent. If a sequence of 
these instructions is used to perform extended- 
precision addition or subtraction, the same mode 
should be used throughout the sequence. 








Negate XO-form 


neg RT,RA (OE=0 Rc=0) 
neg. RT,RA (OE=0 Rc=1) 
nego RT,RA (OE=1 Rc=0) 
nego. RT,RA (OE=1 Rc=1) 

31 RT RA /// \OE 104 Re 
0 6 11 16 21 | 22 31 
RT €-=(RA) + 1 


The sum -=(RA) + 1 is placed into register RT. 


If the processor is in 64-bit mode and register RA 
contains the most negative 64-bit number (0x8000_ 
0000_0000_0000), the result is the most negative 
number and, if OE=1, OV is set to 1. Similarly, if the 
processor is in 32-bit mode and (RA)39.¢3 contain the 
most negative 32-bit number (0x8000_0000), the low- 
order 32 bits of the result contain the most negative 
32-bit number and, if OE=1, OV is set to 1. 


Special Registers Altered: 


CRO (if Re=1) 
SO OV (if OE=1) 
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Multiply Low Immediate D-form 


mulli RT,RA,SI 
[POWER mnemonic: muli] 


7 RT RA Sl 
0 6 11 16 31 


prodo-497 ¢ (RA) x EXTS (ST) 

RT € prodga4.427 

The 64-bit first operand is (RA). The 64-bit second 
operand is the sign-extended value of the SI field. 
The low-order 64 bits of the 128-bit product of the 
operands are placed into register RT. 


Both operands and the product are interpreted as 
signed integers. 


Special Registers Altered: 
None 


Multiply Low Doubleword XO-form 


mulld RT,RA,RB (OE=0 Rc=0) 
mulld. RT,RA,RB (OE=0 Ro=1) 
mulldo RT,RA,RB (OE=1 Rc=0) 
mulido. — RT,RA,RB (OE=1 Re=1) 

31 RT RA RB |OE| 233 |Rc 
0 6 11 16 21 | 22 31 


prodo:497 ¢ (RA) x (RB) 
RT € prodg4.427 
The 64-bit operands are (RA) and (RB). The low-order 


64 bits of the 128-bit product of the operands are 
placed into register RT. 


If OE=1 then OV is set to 1 if the product cannot be 
represented in 64 bits. 


Both operands and the product are interpreted as 
signed integers. 


Special Registers Altered: 
CRO (if Re=1) 
(if OE=1) 


SO OV 





Programming Note 


The XO-form Multiply instructions may execute 
faster on some implementations if RB contains 
the operand having the smaller absolute value. 





Multiply Low Word XO-form 


mullw RT,RA,RB (OE=0 Rc=0) 
mullw. RT,RA,RB (OE=0 Rce=1) 
mullwo RT,RA,RB (OE=1 Rc=0) 
mullwo. RT,RA,RB (OE=1 Rce=1) 


[POWER mnemonics: muls, muls., mulso, mulso.] 


31 RT | RA | RB [OE] 235 
0 6 11 16 21 | 22 31 
RT € (RA) 39.63 X (RB) 39:63 


The 32-bit operands are the low-order 32 bits of RA 
and of RB. The 64-bit product of the operands is 
placed into register RT. 


If OE=1 then OV is set to 1 if the product cannot be 
represented in 32 bits. 


Both operands and the product are interpreted as 
signed integers. 


Special Registers Altered: 
CRO (if Re=1) 
(if OE=1) 





Programming Note 


For mulli and mullw, the low-order 32 bits of the 
product are the correct 32-bit product for 32-bit 
mode. 


For mulli and mulld, the low-order 64 bits of the 
product are independent of whether the operands 
are regarded as signed or unsigned 64-bit inte- 
gers. For mulli and mullw, the low-order 32 bits 
of the product are independent of whether the 
Operands are regarded as signed or unsigned 
32-bit integers. 
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Multiply High Doubleword XO-form 


mulhd RT,RA,RB 
mulhd. RT,RA,RB 


31 RT RA RB / 73 Re 
0 6 14 16 a1 | 22 31 


prodo-497 ¢ (RA) x (RB) 
RT ¢ prodg.g3 


The 64-bit operands are (RA) and (RB). The high- 
order 64 bits of the 128-bit product of the operands 
are placed into register RT. 


Both operands and the product are interpreted as 
signed integers. 


Special Registers Altered: 
CRO (if Re=1) 


Multiply High Doubleword Unsigned 
XO-form 


mulhdu 
mulhdu. 


31 RT RA RB / 9 Re 
0 6 11 16 21 | 22 31 


prodo.497 ¢ (RA) x (RB) 
RT ¢ prodg.g3 


The 64-bit operands are (RA) and (RB). The high- 
order 64 bits of the 128-bit product of the operands 
are placed into register RT. 


RT,RA,RB 
RT,RA,RB 


Both operands and the product are interpreted as 
unsigned integers, except that if Rc=1 the first three 
bits of CR Field 0 are set by signed comparison of the 
result to zero. 


Special Registers Altered: 
CRO (if Re=1) 


Multiply High Word XO-form 


RT,RA,RB 


mulhw = 
RT,RA,RB (Rc=1) 


mulhw. 


31 RT RA RB / 75 Re 
0 6 14 16 a1 | 22 31 


prodg.e3 € (RA) 39.63 x (RB) 39:63 
RT 39-63 € prodg,s1 
RTo.3; € undefined 


The 32-bit operands are the low-order 32 bits of RA 
and of RB. The high-order 32 bits of the 64-bit 
product of the operands are placed into RT39.¢3._ The 
contents of RTo.3, are undefined. 


Both operands and the product are interpreted as 
signed integers. 


Special Registers Altered: 
CRO (bits 0:2 undefined in 64-bit mode) (if Rc=1) 


Multiply High Word Unsigned XO-form 


mulhwu RT,RA,RB (Rce=0) 
mulhwu. RT,RA,RB (Rc=1) 

31 RT RA RB / 11 Re 
0 6 11 16 21 | 22 31 


prodo.eg ¢ (RA) 32.93 x (RB) 39:63 
RT 39.63 € Prody.34 
RTo.3; € undefined 


The 32-bit operands are the low-order 32 bits of RA 
and of RB. The high-order 32 bits of the 64-bit 
product of the operands are placed into RT39.g3. The 
contents of RTo.3,; are undefined. 


Both operands and the product are interpreted as 
unsigned integers, except that if Rc=1 the first three 
bits of CR Field 0 are set by signed comparison of the 
result to zero. 


Special Registers Altered: 
CRO (bits 0:2 undefined in 64-bit mode) (if Rc=1) 
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Divide Doubleword XO-form 


divd RT,RA,RB (OE=0 Rc=0) 
divd. RT,RA,RB (OE=0 Rc=1) 
divdo RT,RA,RB (OE=1 Rc=0) 
divdo. RT,RA,RB (OE=1 Re=1) 


31 RT RA RB |OE 489 Re 
0 6 i 16 21 | 22 31 


dividendo.g3 ¢ (RA) 
divisoro.g3 ¢ (RB) 
RT € dividend + divisor 


The 64-bit dividend is (RA). The 64-bit divisor is (RB). 
The 64-bit quotient of the dividend and divisor is 
placed into register RT. The remainder is not sup- 
plied as a result. 


Both operands and the quotient are interpreted as 
signed integers. The quotient is the unique signed 
integer that satisfies 


dividend = (quotient x divisor) + r 


where O< r< |divisor| if the dividend is nonnegative, 
and — |divisor| < r< 0 if the dividend is negative. 


If an attempt is made to perform any of the divisions 


0x8000 0000 0000 0000 + -1 
<anything> + 0 


then the contents of register RT are undefined as are 
(if Rc=1) the contents of the LT, GT, and EQ bits of 
CR Field 0. In these cases, if OE=1 then OV is set to 
a 


Special Registers Altered: 
CRO (if Re=1) 
SO OV (if OE=1) 





Programming Note 


The 64-bit signed remainder of dividing (RA) by 
(RB) can be computed as follows, except in the 
case that (RA) = — 26 and (RB) = —1. 


divd RT,RA,RB # RT = quotient 
mulld RT,RT,RB # RT = quotient*divisor 
subf RT,RT,RA # RT = remainder 








Divide Word XO-form 


divw RT,RA,RB (OE=0 Rc=0) 
divw. RT,RA,RB (OE=0 Rc=1) 
divwo RT,RA,RB (OE=1 Re=0) 
divwo. RT,RA,RB (OE=1 Re=1) 


31 RT RA RB |OE 491 Re 
0 6 11 16 21 | 22 31 


dividend .g3 ¢ EXTS ( (RA) 30:63) 
divisory.¢63 ¢ EXTS ( (RB) 32:63) 

RT 39.63 ¢ dividend = divisor 
RT9-34 ¢€ undefined 


The 64-bit dividend is the sign-extended value of 
(RA)30.¢63- The 64-bit divisor is the sign-extended 
value of (RB)30.63. The 64-bit quotient is formed. The 
low-order 32 bits of the 64-bit quotient are placed into 
RT39.g3. The contents of RTo.3; are undefined. The 
remainder is not supplied as a result. 


Both operands and the quotient are interpreted as 
signed integers. The quotient is the unique signed 
integer that satisfies 


dividend = (quotient x divisor) + r 


where O< r< |divisor| if the dividend is nonnegative, 
and — |divisor| < r< O if the dividend is negative. 


If an attempt is made to perform any of the divisions 


0x8000_0000 + —1 
<anything> = 0 


then the contents of register RT are undefined as are 
(if Rc=1) the contents of the LT, GT, and EQ bits of 
CR Field 0. In these cases, if OE=1 then OV is set to 
a 


Special Registers Altered: 
CRO (bits 0:2 undefined in 64-bit mode) (if Rc=1) 
SO OV (if OE=1) 


Programming Note 


The 32-bit signed remainder of dividing (RA)s39-63 
by (RB)3o.63 can be computed as follows, except in 
the case that (RA)39.63 = — 23! and (RB)g9.63 = —1. 


divw RT,RA,RB # RT = quotient 
mullw RT,RT,RB # RT = quotient*divisor 
subf RT,RT,RA # RT = remainder 
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Divide Doubleword Unsigned XO-form 


divdu RT,RA,RB (OE=0 Rc=0) 
divdu. RT,RA,RB (OE=0 Rce=1) 
divduo RT,RA,RB (OE=1 Rc=0) 
divduo. RT,RA,RB (OE=1 Re=1) 


31 RT RA RB |OE 457 Re 
0 6 14 16 21 | 22 31 


dividendo.g3 ¢ (RA) 
divisoro.g3 ¢ (RB) 
RT € dividend + divisor 


The 64-bit dividend is (RA). The 64-bit divisor is (RB). 
The 64-bit quotient of the dividend and divisor is 
placed into register RT. The remainder is not sup- 
plied as a result. 


Both operands and the quotient are interpreted as 
unsigned integers, except that if Rc=1 the first three 
bits of CR Field 0 are set by signed comparison of the 
result to zero. The quotient is the unique unsigned 
integer that satisfies 


dividend = (quotient x divisor) + r 


where 0O< r< divisor. 


If an attempt is made to perform the division 
<anything> = 0 


then the contents of register RT are undefined as are 
(if Rc=1) the contents of the LT, GT, and EQ bits of 
CR Field 0. In this case, if OE=1 then OV is set to 1. 


Special Registers Altered: 
CRO (if Re=1) 
SO OV (if OE=1) 


Programming Note 


The 64-bit unsigned remainder of dividing (RA) by 
(RB) can be computed as follows. 


divdu RT,RA,RB # RT = quotient 


mulld RT,RT,RB # RT = quotient*divisor 
subf RT,RT,RA # RT = remainder 











Divide Word Unsigned XO-form 


divwu RT,RA,RB (OE=0 Rc=0) 
divwu. RT,RA,RB (OE=0 Rc=1) 
divwuo RT,RA,RB (OE=1 Rc=0) 
divwuo. RT,RA,RB (OE=1 Rc=1) 


31 RT RA RB |OE 459 Re 
0 6 1 16 21 | 22 31 


dividend .g3 ¢ 329 Il (RA) 30-63 
divisorg.g¢3 ¢ 329 II (RB) 39-63 

RT 30.63 ¢ dividend = divisor 
RTo-31 € undefined 


The 64-bit dividend is the zero-extended value of 
(RA)30.63- The 64-bit divisor is the zero-extended 
value of (RB)30.63. The 64-bit quotient is formed. The 
low-order 32 bits of the 64-bit quotient are placed into 
RT39.63. The contents of RTo.3; are undefined. The 
remainder is not supplied as a result. 


Both operands and the quotient are interpreted as 
unsigned integers, except that if Rc=1 the first three 
bits of CR Field 0 are set by signed comparison of the 
result to zero. The quotient is the unique unsigned 
integer that satisfies 


dividend = (quotient x divisor) + r 


where 0< r< divisor. 


If an attempt is made to perform the division 
<anything> = 0 


then the contents of register RT are undefined as are 
(if Rc=1) the contents of the LT, GT, and EQ bits of 
CR Field 0. In this case, if OE=1 then OV is set to 1. 


Special Registers Altered: 
CRO (bits 0:2 undefined in 64-bit mode) (if Rc=1) 
SO OV (if OE=1) 





Programming Note 


The 32-bit unsigned remainder of dividing 
(RA)39.63 by (RB)30.63 can be computed as follows. 


divwu RT,RA,RB # RT = quotient 
mullw RT,RT,RB # RT = quotient*divisor 
subf RT,RT,RA # RT = remainder 
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3.3.9 Fixed-Point Compare Instructions 


The fixed-point Compare instructions compare the 
contents of register RA with (1) the sign-extended 
value of the SI field, (2) the zero-extended value of 
the UI field, or (3) the contents of register RB. The 
comparison is signed for cmpi and cmp, and unsigned 
for cmpli and empl. 


The L field controls whether the operands are treated 
as 64-bit or 32-bit quantities, as follows: 


L Operand length 
0 32-bit operands 
1 64-bit operands 


When the operands are treated as 32-bit signed quan- 
tities, bit 32 of the register (RA or RB) is the sign bit. 


The Compare instructions set one bit in the leftmost 
three bits of the designated CR field to 1, and the 
other two to 0. XERgg is copied to bit 3 of the desig- 
nated CR field. 


Compare Immediate D-form 


cmpi BF,L,RA,SI 


; 
0 6 9nd 11 16 31 


if L = 0 thena ¢ EXTS ( (RA) 39.63) 


else a ¢ (RA) 
if a < EXTS(SI) then c € 0b100 
else if a > EXTS(SI) then c € 0b010 
else c € 0b001 


CR4xBF:4xBF+3 € ¢ Il XERgo 


The contents of register RA ((RA)39.¢3 sign-extended 
to 64 bits if L=0) are compared with the sign- 
extended value of the SI field, treating the operands 
as signed integers. The result of the comparison is 
placed into CR field BF. 


Special Registers Altered: 
CR field BF 
Extended Mnemonics: 


Examples of extended mnemonics for Compare Imme- 
diate: 


Extended: Equivalent to: 
cmpdi_ Rx,value cmpi_ 0,1,Rx,value 
cmpwi_ cr3,Rx,value cmpi 3,0,Rx,value 


The CR field is set as follows. 
Description 
(RA) < Sl or (RB) ( 
(RA) € Ul or (RB) ( 
1 GT (RA) > Sl or (RB) (signed comparison) 
( ( 
(R 


Bit Name 
0 LT signed comparison) 


unsigned comparison) 


RA) > Ul or (RB) (unsigned comparison) 
A) = Sl, UI, or (RB) 
Summary Overflow from the XER 


EQ 
3 SO 


Extended mnemonics for compares 


A set of extended mnemonics is provided so that 
compares can be coded with the operand length as 
part of the mnemonic rather than as a numeric 
operand. Some of these are shown as examples with 
the Compare instructions. See Appendix B, “Assem- 
bler Extended Mnemonics” on page 143 for additional 
extended mnemonics. 


Compare X-form 


BF,L,RA,RB 


Sn aaa a ! 
11 31 


if L = 0 thena ¢ EXTS ( (RA) 39.63) 
b € EXTS ( (RB) a9¢9) 


else a ¢ (RA) 

b € (RB) 
if a < b then c € 0b100 
else if a > b then c € 0b010 
else c € 0b001 


CRaxBF:4xBF+3 ¢ ¢ Il XERgo 


The contents of register RA ((RA)30.63 if L=0) are 
compared with the contents of register RB ((RB)39.¢3 if 
L=0), treating the operands as signed integers. The 
result of the comparison is placed into CR field BF. 


Special Registers Altered: 
CR field BF 


Extended Mnemonics: 
Examples of extended mnemonics for Compare: 


Extended: Equivalent to: 
cmpd_Rx,Ry cmp _0,1,Rx,Ry 
cmpw__cr3,Rx,Ry cmp — 3,0,Rx,Ry 
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Compare Logical Immediate D-form 


cmpli BF,L,RA,UI 


10 BF j/jL} RA Ul 
0 6 |9g11 16 31 


if L = 0 then a ¢ %0 Il (RA) 3963 


else a ¢ (RA) 
if a 2 (489 Il UI) then c € 0b100 
else if a 5 (480 Il UI) then c € 0b010 
else c € 0b001 


CR4xBF:4xBF+3 € ¢ Il XERgo 


The contents of register RA (RA) 32:69 zero-extended 
to 64 bits if L=0) are compared with “°0 || Ul, treating 
the operands as unsigned integers. The result of the 
comparison is placed into CR field BF. 


Special Registers Altered: 
CR field BF 
Extended Mnemonics: 


Examples of extended mnemonics for Compare 
Logical Immediate: 


Extended: Equivalent to: 
cmpldi Rx,value cmpli 0,1,Rx,value 
cmplwi cr3,Rx,value cmpli 3,0,Rx,value 


Compare Logical X-form 


cmpl BF,L,RA,RB 


31 BF |/iL} RA RB 32 / 
0 6 j9ngt1 16 21 31 


if L = 0 then a ¢ %0 Il (RA) 3963 
b € 220 Il (RB) 39.63 


else a ¢ (RA) 

b ¢ (RB) 
if a <b then c € 0b100 
else if a > b then c ¢ 0b010 
else c € 0b001 


CRaxBF:4xBF+3 ¢ ¢ Il XERgo 


The contents of register RA ((RA)3..63 if L=0) are 
compared with the contents of register RB ((RB)39.g3 if 
L=0), treating the operands as unsigned integers. 
The result of the comparison is placed into CR field 
BF. 


Special Registers Altered: 
CR field BF 
Extended Mnemonics: 


Examples of extended mnemonics for Compare 
Logical: 


Extended: Equivalent to: 
cmpld Rx,Ry cmpl  0,1,Rx,Ry 
cmplw_ cr3,Rx,Ry cmpl 3,0,Rx,Ry 
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3.3.10 Fixed-Point Trap Instructions 


The Trap instructions are provided to test for a speci- 
fied set of conditions. If any of the conditions tested 
by a Trap instruction are met, the system trap handler 
is invoked. If none of the tested conditions are met, 
instruction execution continues normally. 


The contents of register RA are compared with either 
the sign-extended value of the SI field or the contents 
of register RB, depending on the Trap instruction. For 
tdi and td, the entire contents of RA (and RB) partic- 
ipate in the comparison; for twi and tw, only the con- 
tents of the low-order 32 bits of RA (and RB) 
participate in the comparison. 


This comparison results in five conditions which are 


ANDed with TO. If the result is not 0 the system trap 
handler is invoked. These conditions are as follows. 


Trap Doubleword Immediate D-form 


tdi TO,RA,SI 
2 TO RA Sl 
0 6 11 16 31 
a € (RA) 
if (a < EXTS(SI)) & TOg then TRAP 
if (a > EXTS(SI)) & TO, then TRAP 
if (a = EXTS(SI)) & TO» then TRAP 
if (a < EXTS(SI)) & TO then TRAP 
if (a 5 EXTS(SI)) & TO, then TRAP 


The contents of register RA are compared with the 
sign-extended value of the SI field. If any bit in the 
TO field is set to 1 and its corresponding condition is 
met by the result of the comparison, the system trap 
handler is invoked. 


Special Registers Altered: 
None 
Extended Mnemonics: 


Examples of extended 
Doubleword Immediate: 


mnemonics for Trap 


Extended: Equivalent to: 
tdlti Rx,value tdi 16,Rx,value 
tdnei Rx,value tdi 24,Rx,value 


TO Bit ANDed with Condition 

0 Less Than, using signed comparison 

1 Greater Than, using signed comparison 

2 Equal 

3 Less Than, using unsigned comparison 

4 Greater Than, using unsigned comparison 


Extended mnemonics for traps 


A set of extended mnemonics is provided so that 
traps can be coded with the condition as part of the 
mnemonic rather than as a numeric operand. Some 
of these are shown as examples with the Trap 
instructions. See Appendix B, “Assembler Extended 
Mnemonics” on page 143 for additional extended 
mnemonics. 


Trap Word Immediate D-form 


twi TO,RA,SI 
[POWER mnemonic: ti] 


3 TO RA Sl 
0 6 11 16 31 


if (a < EXTS(SI)) & TO) then TRAP 
if (a > EXTS(SI)) & TO, then TRAP 
if (a = EXTS(SI)) & TO» then TRAP 
if (a < EXTS(SI)) & TO3 then TRAP 
if (a 5 EXTS(SI)) & TO, then TRAP 


The contents of RAgo.g3 are compared with the sign- 
extended value of the Sl field. If any bit in the TO 
field is set to 1 and its corresponding condition is met 
by the result of the comparison, the system trap 
handler is invoked. 


Special Registers Altered: 
None 
Extended Mnemonics: 


Examples of extended mnemonics for Trap Word 
Immediate: 


Extended: Equivalent to: 
twgti Rx,value twi 8,Rx,value 
twllei Rx,value twi 6,Rx,value 
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Trap Doubleword X-form 


td TO,RA,RB 
31 TO RA RB 68 / 
0 6 i 16 21 31 
a € (RA) 
b ¢ (RB) 
if (a < b) & TO) then TRAP 
if (a > b) & TO; then TRAP 
if (a = b) & TOs then TRAP 
if (a <b) & TOg then TRAP 
if (a 5 b) & TO, then TRAP 


The contents of register RA are compared with the 
contents of register RB. If any bit in the TO field is 
set to 1 and its corresponding condition is met by the 
result of the comparison, the system trap handler is 
invoked. 


Special Registers Altered: 
None 


Extended Mnemonics: 


Examples of extended mnemonics’ for Trap 
Doubleword: 

Extended: Equivalent to: 

tdge Rx,Ry td 12,Rx,Ry 

tdinl Rx,Ry td 5,Rx,Ry 


Trap Word X-form 


tw TO,RA,RB 
[POWER mnemonic: t] 


31 TO RA RB 4 / 
0 6 11 16 21 31 


b ¢ EXTS((RB) 39.9) 

if (a < b) & TO then TRAP 
if (a > b) & TO; then TRAP 
if (a = b) & TOs then TRAP 
if (a <b) & TOg then TRAP 
if (a 5b) & TO, then TRAP 


The contents of RAgo.g3 are compared with the con- 
tents of RB3o.g3._ If any bit in the TO field is set to 1 
and its corresponding condition is met by the result of 
the comparison, the system trap handler is invoked. 


Special Registers Altered: 
None 


Extended Mnemonics: 
Examples of extended mnemonics for Trap Word: 


Extended: Equivalent to: 
tweq  Rx,Ry tw 4,Rx,Ry 
twlge Rx,Ry tw 5,Rx,Ry 
trap tw 31,0,0 
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3.3.11 Fixed-Point Logical Instructions 


The Logical instructions perform bit-parallel oper- 
ations on 64-bit operands. 


The X-form Logical instructions with Rc=1, and the 
D-form Logical instructions andi. and andis., set the 
first three bits of CR Field 0 as described in Section 
3.3.7, “Other Fixed-Point Instructions” on page 48. 
The Logical instructions do not change the SO, OV, 
and CA bits in the XER. 


AND Immediate D-form 


andi. RA,RS,UI 
[POWER mnemonic: andil.] 


28 RS RA Ul 
0 6 11 16 31 


RA € (RS) & (480 II UZ) 
The contents of register RS are ANDed with 480 | UI 
and the result is placed into register RA. 


Special Registers Altered: 
CRO 


Extended mnemonics for logical 
operations 


An extended mnemonic is provided that generates the 
preferred form of “no-op” (an instruction that does 
nothing). This is shown as an example with the OR 
Immediate instruction. 


Extended mnemonics are provided that use the OR 
and NOR instructions to copy the contents of one reg- 
ister to another, with and without complementing. 
These are shown as examples with the two 
instructions. 


See Appendix B, “Assembler Extended Mnemonics” 
on page 143 for additional extended mnemonics. 


AND Immediate Shifted D-form 


andis. RA,RS,UI 
[POWER mnemonic: andiu.] 


29 RS RA Ul 
0 6 11 16 31 


RA € (RS) & (340 Il UI II 180) 
The contents of register RS are ANDed with °20 || UI 
11 180 and the result is placed into register RA. 


Special Registers Altered: 
CRO 
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OR Immediate D-form 


ori RA,RS,UI 
[POWER mnemonic: oril] 


24 RS RA Ul 
0 6 11 16 31 


RA € (RS) | (480 II UI) 
The contents of register RS are ORed with 480 || UI 
and the result is placed into register RA. 
The preferred “no-op” (an instruction that does 
nothing) is: 
ori 0,0,0 
Special Registers Altered: 
None 
Extended Mnemonics: 
Example of extended mnemonics for OR Immediate: 


Extended: Equivalent to: 
nop ori 0,0,0 


XOR Immediate D-form 


xori RA,RS,UI 
[POWER mnemonic: xoril] 


26 RS RA Ul 
0 6 11 16 31 


RA ¢ (RS) ® (480 II UI) 
The contents of register RS are XORed with 480 II UI 
and the result is placed into register RA. 


Special Registers Altered: 
None 


OR Immediate Shifted D-form 


oris RA,RS,UI 
[POWER mnemonic: oriu] 


25 RS RA Ul 
0 6 11 16 31 


RA ¢ (RS) | (320 Il UI II 180) 


The contents of register RS are ORed with 220 || UI II 
169 and the result is placed into register RA. 


Special Registers Altered: 
None 


XOR Immediate Shifted D-form 


xoris RA,RS,UI 
[POWER mnemonic: xoriu] 


27 RS RA Ul 
0 6 11 16 31 


RA € (RS) © (320 Il UI II 180) 
The contents of register RS are XORed with °20 II UI 
11 160 and the result is placed into register RA. 


Special Registers Altered: 
None 
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AND X-form 
and RA,RS,RB (Rc=0) 
and. RA,RS,RB (Ro=1) 


31 RS RA RB 28 Re 
0 6 11 16 21 31 


RA € (RS) & (RB) 


The contents of register RS are ANDed with the con- 
tents of register RB and the result is placed into reg- 


ister RA. 
Special Registers Altered: 

CRO (if Re=1) 
XOR X-form 
xor RA,RS,RB (Rc=0) 
xor. RA,RS,RB (Rc=1) 


31 RS RA RB 316 Re 
0 6 11 16 21 31 


RA € (RS) ® (RB) 


The contents of register RS are XORed with the con- 
tents of register RB and the result is placed into reg- 


ister RA. 


Special Registers Altered: 
CRO (if Re=1) 


OR X-form 
or RA,RS,RB (Rc=0) 
or. RA,RS,RB (Ro=1) 


31 RS RA RB 444 Re 
0 6 11 16 21 31 


RA € (RS) | (RB) 


The contents of register RS are ORed with the con- 
tents of register RB and the result is placed into reg- 
ister RA. 


Special Registers Altered: 
CRO (if Re=1) 


Extended Mnemonics: 
Example of extended mnemonics for OR: 


Extended: Equivalent to: 

mr Rx,Ry or Rx,Ry,Ry 
NAND X-form 
nand RA,RS,RB (Rc=0) 
nand. RA,RS,RB (Rc=1) 


31 RS RA RB 476 Re 
0 6 11 16 21 31 


RA € 3((RS) & (RB)) 


The contents of register RS are ANDed with the con- 
tents of register RB and the complemented result is 
placed into register RA. 


Special Registers Altered: 
CRO (if Re=1) 


Programming Note 











nand or nor with RS=RB can be used to obtain 
the one's complement. 
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NOR X-form 
nor RA,RS,RB (Rc=0) 
nor. RA,RS,RB (Rc=1) 


31 RS RA RB 124 Re 
0 6 11 16 21 31 


RA € 3((RS) | (RB)) 


The contents of register RS are ORed with the con- 
tents of register RB and the complemented result is 
placed into register RA. 


Special Registers Altered: 
CRO (if Re=1) 


Extended Mnemonics: 
Example of extended mnemonics for NOR: 


Extended: 
not Rx,Ry nor 


Equivalent to: 
Rx,Ry,Ry 


AND with Complement X-form 


andc RA,RS,RB 
andc. RA,RS,RB 


(Rc=0) 
(Rce=1) 


31 RS RA RB 60 Re 
0 6 11 16 21 31 


RA ¢ (RS) & (RB) 


The contents of register RS are ANDed with the com- 
plement of the contents of register RB and the result 
is placed into register RA. 


Special Registers Altered: 
CRO (if Re=1) 


Equivalent X-form 


eqv RA,RS,RB (Rc=0) 
eqv. RA,RS,RB (Rc=1) 

31 RS RA RB 284 Re 
0 6 11 16 21 31 
RA € (RS) = (RB) 


The contents of register RS are XORed with the con- 
tents of register RB and the complemented result is 
placed into register RA. 


Special Registers Altered: 
CRO (if Re=1) 


OR with Complement X-form 


orc RA,RS,RB 
ore. RA,RS,RB 


31 RS RA RB 412 Re 
0 6 11 16 21 31 


RA € (RS) | 7(RB) 


(Rc=0) 
(Rce=1) 


The contents of register RS are ORed with the com- 
plement of the contents of register RB and the result 
is placed into register RA. 


Special Registers Altered: 
CRO (if Re=1) 
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Extend Sign Byte X-form 


extsb RA,RS (Rc=0) 

extsb. RA,RS (Rc=1) 
31 RS RA /// 954 Re 

0 6 1 16 24 31 

S € (RS) 56 

RAgg:63 €_ (RS) 56-63 


(RS)sg-63 are placed into RAsg.e3. Bit 56 of register RS 
is placed into RAg 55. 


Special Registers Altered: 
CRO (if Re=1) 


Extend Sign Word X-form 


extsw RA,RS (Rc=0) 

extsw. RA,RS (Rc=1) 
31 RS RA //I 986 Re 

0 6 11 16 21 31 

S € (RS) 35 

RAgo-63 ¢ (RS) 32:63 

RAg31 ¢ %s 


(RS)30.63 are placed into RA39.¢3. Bit 32 of register RS 
is placed into RAp3;. 


Special Registers Altered: 
CRO (if Re=1) 


Extend Sign Halfword X-form 


extsh RA,RS 
extsh. RA,RS 


[POWER mnemonics: exts, exts.] 


31 RS RA /// 922 Re 
0 6 11 16 21 31 


S € (RS) 4g 

RAgge3 ¢ (RS) ag:63 

RAg.a7 ¢ “8s 

(RS)4g-63 are placed into RAgg.63. Bit 48 of register RS 
is placed into RAo.47. 


Special Registers Altered: 
CRO (if Re=1) 
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Count Leading Zeros Doubleword 
X-form 


cntlzd RA,RS (Rc=0) 
entlzd. RA,RS (Rc=1) 

31 RS RA /// 58 Re 
0 6 11 16 21 31 
n¢0 


do while n < 64 
if (RS), = 1 then leave 
néenil 

RA €n 


A count of the number of consecutive zero bits 
starting at bit 0 of register RS is placed into RA. This 
number ranges from 0 to 64, inclusive. 


If Rc=1, CR Field 0 is set to reflect the result. 


Special Registers Altered: 
CRO (if Re=1) 


Count Leading Zeros Word X-form 


RA,RS 
RA,RS 


[POWER mnemonics: entlz, cntlz.] 


31 RS RA //I 26 Re 
0 6 11 16 21 31 


n € 32 

do while n < 64 
if (RS), = 1 then leave 
néenil 

RA €n — 32 


cntlzw 
entlzw. 


A count of the number of consecutive zero bits 
starting at bit 32 of register RS is placed into RA. 
This number ranges from 0 to 32, inclusive. 


lf Rc=1, CR Field 0 is set to reflect the result. 


Special Registers Altered: 
CRO (if Re=1) 





Programming Note 


For both Count Leading Zeros instructions, if 
Rc=1 then LT is set to 0 in CR Field 0. 
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3.3.12 Fixed-Point Rotate and Shift Instructions 


The Fixed-Point Processor performs rotation oper- 
ations on data from a GPR and returns the result, or a 
portion of the result, to a GPR. 


The rotation operations rotate a 64-bit quantity left by 
a specified number of bit positions. Bits that exit from 
position O enter at position 63. 


Two types of rotation operation are supported. 


For the first type, denoted rotateg, or ROTL¢4, the 
value rotated is the given 64-bit value. The rotateg, 
operation is used to rotate a given 64-bit quantity. 


For the second type, denoted rotate35 or ROTLgs, the 
value rotated consists of two copies of bits 32:63 of 
the given 64-bit value, one copy in bits 0:31 and the 
other in bits 32:63. The rotate, operation is used to 
rotate a given 32-bit quantity. 


The Rotate and Shift instructions employ a mask gen- 
erator. The mask is 64 bits long, and consists of 
1-bits from a start bit, mstart, through and including a 
stop bit, mstop, and O-bits elsewhere. The values of 
mstart and mstop range from 0 to 63. If mstart > 
mstop, the 1-bits wrap around from position 63 to 
position 0. Thus the mask is formed as follows: 


if mstart < mstop then 
ones 
zeros 


maskmstart:mstop 


maskai! other bits 
else 


Maskmstart:63 = Ones 
MaSko-mstop = ones 
Maskail other bits = Z€4OS 


There is no way to specify an all-zero mask. 


For instructions that use the rotate3. operation, the 
mask start and stop positions are always in the low- 
order 32 bits of the mask. 


The use of the mask is described in following 
sections. 


The Rotate and Shift instructions with Rc=1 set the 
first three bits of CR field 0 as described in Section 
3.3.7, “Other Fixed-Point Instructions” on page 48. 
Rotate and Shift instructions do not change the OV 
and SO bits. Rotate and Shift instructions, except 
algebraic right shifts, do not change the CA bit. 


Extended mnemonics for rotates and 
shifts 


The Rotate and Shift instructions, while powerful, can 
be complicated to code (they have up to five oper- 
ands). A set of extended mnemonics is provided that 
allow simpler coding of often-used functions such as 
clearing the leftmost or rightmost bits of a register, 
left justifying or right justifying an arbitrary field, and 
performing simple rotates and shifts. Some of these 
are shown as examples with the Aotate instructions. 
See Appendix B, “Assembler Extended Mnemonics” 
on page 143 for additional extended mnemonics. 


3.3.12.1 Fixed-Point Rotate Instructions 


These instructions rotate the contents of a register. 
The result of the rotation is 


m inserted into the target register under control of a 
mask (if a mask bit is 1 the associated bit of the 
rotated data is placed into the target register, 
and if the mask bit is 0 the associated bit in the 
target register remains unchanged); or 


= ANDed with a mask before being placed into the 
target register. 


The Rotate Left instructions allow right-rotation of the 
contents of a register to be performed (in concept) by 
a left-rotation of 64—n, where n is the number of bits 
by which to rotate right. They allow right-rotation of 
the contents of the low-order 32 bits of a register to 
be performed (in concept) by a left-rotation of 32—n, 
where n is the number of bits by which to rotate right. 
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Rotate Left Doubleword Immediate then 
Clear Left MD-form 


ridicl RA,RS,SH,MB 
ridicl. RA,RS,SH,MB 


30 RS RA sh mb 0 IshiRa 
0 6 11 16 21 27 |30)31 


n ¢ shs Il sho-4 

r € ROTLe4((RS), n) 
b € Tbs, ll Tb9-4 

m € MASK(b, 63) 
RA¢ré&m 


The contents of register RS are rotatedg, left SH bits. 
A mask is generated having 1-bits from bit MB 
through bit 63 and O-bits elsewhere. The rotated data 
are ANDed with the generated mask and the result is 
placed into register RA. 


Special Registers Altered: 
CRO (if Re=1) 
Extended Mnemonics: 


Examples of extended mnemonics for Rotate Left 
Doubleword Immediate then Clear Left: 


Extended: Equivalent to: 

extrdi Rx,Ry,n,b ridicl Rx,Ry,o+n,64—n 
srdi Rx,Ry,n ridicl Rx,Ry,64—n,n 
clridi Rx,Ry,n ridicl Rx,Ry,0,n 


Programming Note 


Rotate Left Doubleword Immediate then 
Clear Right MD-form 


ridicr RA,RS,SH,ME 
ridicr. RA,RS,SH,ME 


30 RS RA sh me 1 |shiRa 
0 6 1 16 21 27 |30)31 


n € she ll sho-4 

r € ROTLe4((RS), n) 
e € mes Il meo.4 

m € MASK(0, e) 
RA¢ré&m 


The contents of register RS are rotatedg, left SH bits. 
A mask is generated having 1-bits from bit 0 through 
bit ME and 0-bits elsewhere. The rotated data are 
ANDed with the generated mask and the result is 
placed into register RA. 


Special Registers Altered: 
CRO (if Re=1) 
Extended Mnemonics: 


Examples of extended mnemonics for Rotate Left 
Doubleword Immediate then Clear Right: 


Extended: Equivalent to: 

extldi Rx,Ry,n,b ridicr Rx,Ry,b,n— 1 
sldi Rx,Ry,n ridicr Rx,Ry,n,63—n 
clrrdi Rx,Ry,n rildicr Rx,Ry,0,63—n 





rldicl can be used to extract an n-bit field that 
starts at bit position b in register RS, right- 
justified into register RA (clearing the remaining 
64—n bits of RA), by setting SH=b+n and 
MB=64-n. It can be used to rotate the contents 
of a register left (right) by n bits, by setting SH=n 
(64—n) and MB=0. It can be used to shift the 
contents of a register right by n bits, by setting 
SH=64-—n and MBe=n. It can be used to clear the 
high-order n bits of a register, by setting SH=0 
and MB=n. 


Extended mnemonics are provided for all of these 
uses; see Appendix B, “Assembler Extended 
Mnemonics” on page 143. 








Programming Note 





rldicr can be used to extract an n-bit field that 
starts at bit position b in register RS, left-justified 
into register RA (clearing the remaining 64—n bits 
of RA), by setting SH=b and ME=n-1. It can be 
used to rotate the contents of a register left 
(right) by n bits, by setting SH=n (64—n) and 
ME=63. It can be used to shift the contents of a 
register left by n bits, by setting SH=n and 
ME=63-n. It can be used to clear the low-order 
n bits of a register, by setting SH=0 and 
ME=63-n. 


Extended mnemonics are provided for all of these 
uses (some devolve to fridicl); see Appendix B, 
“Assembler Extended Mnemonics” on page 143. 
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Rotate Left Doubleword Immediate then 
Clear MD-form 


ridic RA,RS,SH,MB 
ridic. RA,RS,SH,MB 


30 RS RA sh mb 2 IshiRa 
0 6 11 16 21 27 |30)31 


n ¢ shs Il sho-4 

1 € ROTLg,4( (RS), n) 
b ¢€ Tbs Il Tb9-4 

m € MASK(b, -n) 
RA¢ré&m 


The contents of register RS are rotatedg, left SH bits. 
A mask is generated having 1-bits from bit MB 
through bit 68— SH and 0-bits elsewhere. The rotated 
data are ANDed with the generated mask and the 
result is placed into register RA. 


Special Registers Altered: 

CRO (if Rc=1) 
Extended Mnemonics: 
Example of extended mnemonics for Rotate Left 
Doubleword Immediate then Clear. 


Extended: Equivalent to: 
clrisidi Rx,Ry,b,n ridic Rx,Ry,n,b—n 





Programming Note 


rldic can be used to clear the high-order b bits of 
the contents of a register and then shift the result 
left by n bits, by setting SH=n and MB=b-n. It 
can be used to clear the high-order n bits of a 
register, by setting SH=0 and MBen. 


Extended mnemonics are provided for both of 
these uses (the second devolves to ridicl); see 
Appendix B, “Assembler Extended Mnemonics” 
on page 143. 








Rotate Left Word Immediate then AND 
with Mask M-form 


rlwinm RA,RS,SH,MB,ME (Re=0) 
rlwinm. RA,RS,SH,MB,ME (Rc=1) 
[POWER mnemonics: rlinm, rlinm.] 

21 RS RA SH MB ME |Rc 
0 6 11 16 21 26 31 
n ¢ SH 


r¢ ROTLigo ( (RS) 39-63, n) 
m ¢ MASK(MB+32, ME+32) 
RA#rém 


The contents of register RS are rotated, left SH bits. 
A mask is generated having 1-bits from bit MB+32 
through bit ME+32 and 0-bits elsewhere. The rotated 
data are ANDed with the generated mask and the 
result is placed into register RA. 


Special Registers Altered: 
CRO (if Re=1) 
Extended Mnemonics: 


Examples of extended mnemonics for Rotate Left 
Word Immediate then AND with Mask: 


Extended: 

extlwi Rx,Ry,n,b 
srwi Rx,Ry,n 
clrrwi Rx,Ry,n 


Equivalent to: 

rlwinm Rx,Ry,b,0,n— 1 
rlwinm Rx,Ry,32-n,n,31 
rlwinm Rx,Ry,0,0,31-1n 
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Programming Note 


Let RSL represent the low-order 32 bits of reg- 
ister RS, with the bits numbered from 0 through 
31. 


rlwinm can be used to extract an n-bit field that 
starts at bit position b in RSL, right-justified into 
the low-order 32 bits of register RA (clearing the 
remaining 32—n bits of the low-order 32 bits of 
RA), by setting SH=b+n, MB=32-n, and 
ME=31. It can be used to extract an n-bit field 
that starts at bit position b in RSL, left-justified 
into the low-order 32 bits of register RA (clearing 
the remaining 32—n bits of the low-order 32 bits of 
RA), by setting SH=b, MB = 0, and ME=n-1. It 
can be used to rotate the contents of the low- 
order 32 bits of a register left (right) by n bits, by 
setting SH=n (82—n), MB=0, and ME=31. It can 
be used to shift the contents of the low-order 32 
bits of a register right by n bits, by setting 
SH=32-n, MB=n, and ME=31. It can be used to 
clear the high-order b bits of the low-order 32 bits 
of the contents of a register and then shift the 
result left by n bits, by setting SH=n, MB=b—n, 
and ME=31-n. It can be used to clear the low- 
order n bits of the low-order 32 bits of a register, 
by setting SH=0, MB=0, and ME=31-n. 


For all the uses given above, the high-order 32 
bits of register RA are cleared. 


Extended mnemonics are provided for all of these 
uses; see Appendix B, “Assembler Extended 
Mnemonics” on page 143. 








Rotate Left Doubleword then Clear Left 
MDS-form 


ridcl RA,RS,RB,MB (Rc=0) 
ridel. RA,RS,RB,MB (Ro=1) 


30 RS RA RB mb 8 |Rc 
0 6 1 16 21 27 | 34 


n € (RB) 59.63 

1 € ROTLe4( (RS), 1) 
b¢ Tbs ll Tmb9-4 

m ¢ MASK(b, 63) 
RA¢ré&m 


The contents of register RS are rotatedg, left the 
number of bits specified by (RB)sg.g3. A mask is gen- 
erated having 1-bits from bit MB through bit 63 and 
0-bits elsewhere. The rotated data are ANDed with 
the generated mask and the result is placed into reg- 
ister RA. 


Special Registers Altered: 

CRO (if Re=1) 
Extended Mnemonics: 
Example of extended mnemonics for Rotate Left 
Doubleword then Clear Lett: 


Extended: Equivalent to: 
rotld Rx,Ry,Rz ridcl Rx,Ry,Rz,0 








Programming Note 


tldcl can be used to extract an n-bit field that 
starts at variable bit position b in register RS, 
right-justified into register RA (clearing the 
remaining 64-n bits of RA), by setting 
RBsg.g3=b +n and MB=64—n. It can be used to 
rotate the contents of a register left (right) by var- 
iable n bits, by setting RBsg.gg=n (64—n) and 
MB=0. 


Extended mnemonics are provided for some of 
these uses; see Appendix B, “Assembler Extended 
Mnemonics” on page 143. 
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Rotate Left Doubleword then Clear Right 
MDS-form 


rider RA,RS,RB,ME 
rider. RA,RS,RB,ME 


30 RS RA RB me 9 |Rc 
0 6 11 16 21 27 «(| 31 


n € (RB) 59.63 

r € ROTL¢4((RS), n) 
e € mes ll MEo-4 

m € MASK(0, e) 
RA¢ré&m 


The contents of register RS are rotatedg, left the 
number of bits specified by (RB)sg.g3. A mask is gen- 
erated having 1-bits from bit 0 through bit ME and 
0-bits elsewhere. The rotated data are ANDed with 
the generated mask and the result is placed into reg- 
ister RA. 


Special Registers Altered: 
CRO (if Re=1) 





Programming Note 


rldcr can be used to extract an n-bit field that 
starts at variable bit position b in register RS, left- 
justified into register RA (clearing the remaining 
64-n bits of RA), by setting RBsge3=b and 
ME=n-1. It can be used to rotate the contents of 
a register left (right) by variable n bits, by setting 
RBs¢.63= n (64— n) and ME=63. 


Extended mnemonics are provided for some of 
these uses (some devolve to. ridcl); see 
Appendix B, “Assembler Extended Mnemonics” 
on page 143. 








Rotate Left Word then AND with Mask 
M-form 


rlwnm RA,RS,RB,MB,ME (Rc=0) 
rlwnm. RA,RS,RB,MB,ME (Rc=1) 
[POWER mnemonics: rinm, rinm.] 

23 RS RA RB MB ME |Rc 
0 6 11 16 21 26 31 
n € (RB) 59:63 


r¢ ROTLigo ( (RS) 39.63, n) 
m ¢ MASK(MB+32, ME+32) 
RAf#rém 


The contents of register RS are rotateds. left the 
number of bits specified by (RB)sg9.63. A mask is gen- 
erated having 1-bits from bit MB+32 through bit 
ME+32 and O-bits elsewhere. The rotated data are 
ANDed with the generated mask and the result is 
placed into register RA. 


Special Registers Altered: 

CRO (if Rc=1) 
Extended Mnemonics: 
Example of extended mnemonics for Rotate Left Word 
then AND with Mask: 


Extended: 
rotlw  Rx,Ry,Rz 


Equivalent to: 
rlwnm Rx,Ry,Rz,0,31 





Programming Note 


Let RSL represent the low-order 32 bits of reg- 
ister RS, with the bits numbered from 0 through 
31. 


rlwnm can be used to extract an n-bit field that 
starts at variable bit position b in RSL, right- 
justified into the low-order 32 bits of register RA 
(clearing the remaining 32—n bits of the low-order 
32 bits of RA), by setting RBsg¢g3=b+n, 
MB=32-n, and ME=31. It can be used to extract 
an n-bit field that starts at variable bit position b 
in RSL, left-justified into the low-order 32 bits of 
register RA (clearing the remaining 32—n bits of 
the low-order 32 bits of RA), by setting RBsg-¢3=b, 
MB = 0, and ME=n-1. It can be used to rotate 
the contents of the low-order 32 bits of a register 
left (right) by variable n bits, by setting RBsg.g3= n 
(32—n), MB=0, and ME=31. 


For all the uses given above, the high-order 32 
bits of register RA are cleared. 


Extended mnemonics are provided for some of 
these uses; see Appendix B, “Assembler Extended 
Mnemonics” on page 143. 
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Rotate Left Doubleword Immediate then 
Mask Insert MD-form 


rldimi 
rldimi. 


RA,RS,SH,MB 
RA,RS,SH,MB 


30 RS RA sh mb 3 IshiRa 
0 6 11 16 21 27 |30]31 


n¢ shs Il sho-4 
r¢ ROTLg4( (RS) , n) 
b € Ts, ll Tb9-4 
m € MASK(b, -n) 
RA € r&ém | (RA) &m 


The contents of register RS are rotatedg, left SH bits. 
A mask is generated having 1-bits from bit MB 
through bit 63— SH and 0-bits elsewhere. The rotated 
data are inserted into register RA under control of the 
generated mask. 


Special Registers Altered: 
CRO (if Re=1) 
Extended Mnemonics: 


Example of extended mnemonics for Rotate Left 
Doubleword Immediate then Mask Insert: 


Extended: 
insrdi Rx,Ry,n,b 


Equivalent to: 
ridimi Rx,Ry,64—(b+n),b 


Programming Note 


rldimi can be used to insert an n-bit field that is 
right-justified in register RS, into register RA 
starting at bit position b, by _ setting 
SH=64-(b+n) and MB=b. 


An extended mnemonic is provided for this use; see 
Appendix B, “Assembler Extended Mnemonics” on 
page 143. 








Rotate Left Word Immediate then Mask 
Insert M-form 


rlwimi RA,RS,SH,MB,ME (Rc=0) 
rlwimi. RA,RS,SH,MB,ME (Rc=1) 
[POWER mnemonics: rlimi, rlimi.] 

20 RS RA SH MB ME |Rc 
0 6 11 16 21 26 31 
n ¢ SH 


r¢ ROTLigo ( (RS) 39-63, n) 
m ¢ MASK(MB+32, ME+32) 
RA € r&ém | (RA) &m 


The contents of register RS are rotated, left SH bits. 
A mask is generated having 1-bits from bit MB+32 
through bit ME+32 and 0-bits elsewhere. The rotated 
data are inserted into register RA under control of the 
generated mask. 


Special Registers Altered: 

CRO (if Rc=1) 
Extended Mnemonics: 
Example of extended mnemonics for Rotate Left Word 
Immediate then Mask Insert: 


Extended: 
inslwi Rx,Ry,n,b 


Equivalent to: 
rlwimi Rx,Ry,32-b,b,b+n-1 





Programming Note 


Let RAL represent the low-order 32 bits of reg- 
ister RA, with the bits numbered from 0 through 
31. 


rlwimi can be used to insert an n-bit field that is 
left-justified in the low-order 32 bits of register 
RS, into RAL starting at bit position b, by setting 
SH=32-b, MB=b, and ME=(b+n)-1. It can be 
used to insert an n-bit field that is right-justified in 
the low-order 32 bits of register RS, into RAL 
starting at bit position b, by — setting 
SH=32-(b+n), MB=b, and ME=(b+n)-1. 


Extended mnemonics are provided for both of 
these uses; see Appendix B, “Assembler Extended 
Mnemonics” on page 143. 
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3.3.12.2 Fixed-Point Shift Instructions 


The instructions in this section perform left and right 
shifts. 


Extended mnemonics for shifts 


Immediate-form logical (unsigned) shift operations are 
obtained by specifying appropriate masks and shift 
values for certain Rotate instructions. A set of 
extended mnemonics is provided to make coding of 
such shifts simpler and easier to understand. Some 
of these are shown as examples with the Aotate 
instructions. See Appendix B, “Assembler Extended 
Mnemonics” on page 143 for additional extended 
mnemonics. 


Shift Left Doubleword X-form 


sid RA,RS,RB (Rc=0) 
sid. RA,RS,RB (Rc=1) 

31 RS RA RB 27 Re 
0 6 11 16 21 31 


1 € ROTLe4( (RS), n) 
if (RB)s7 = 0 then 

m €¢ MASK(0, 63-n) 
else m ¢ 40 
RA¢ré&m 


The contents of register RS are shifted left the 
number of bits specified by (RB)s7.63. Bits shifted out 
of position 0 are lost. Zeros are supplied to the 
vacated positions on the right. The result is placed 
into register RA. Shift amounts from 64 to 127 give a 
zero result. 


Special Registers Altered: 
CRO (if Re=1) 





Programming Note 


Any Shift Right Algebraic instruction, followed by 
addze, can be used to divide quickly by 2". The 
setting of the CA bit by the Shift Right Algebraic 
instructions is independent of mode. 








Programming Note 


Multiple-precision shifts can be programmed as 
shown in Section C.1, “Multiple-Precision Shifts” 
on page 155. 








Shift Left Word X-form 


slw RA,RS,RB 
slw. RA,RS,RB 


[POWER mnemonics: sl, sl.] 


31 RS RA RB 24 Re 
0 6 11 16 21 31 


n € (RB) 59:63 
r¢ ROTLigo ( (RS) 39-63, n) 
if (RB)sg = 0 then 

m ¢ MASK(32, 63-n) 
else m ¢ &0 
RA¢ré&m 


The contents of the low-order 32 bits of register RS 
are shifted left the number of bits specified by 
(RB)sg.63- Bits shifted out of position 32 are lost. 
Zeros are supplied to the vacated positions on the 
right. The 32-bit result is placed into RA3o.63. RAg-31 
are set to zero. Shift amounts from 32 to 63 give a 
zero result. 


Special Registers Altered: 
CRO (if Re=1) 
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Shift Right Doubleword X-form 


std RA,RS,RB (Ro=0) 
srd. RA,RS,RB (Rc=1) 

31 RS RA RB 539 Re 
0 6 1" 16 21 31 
n € (RB) 59.63 


r € ROTLg4((RS), 64—n) 
if (RB) 57 = 0 then 

m ¢ MASK(n, 63) 
else m € 60 
RA¢ré&m 


The contents of register RS are shifted right the 
number of bits specified by (RB)s7.63. Bits shifted out 
of position 63 are lost. Zeros are supplied to the 
vacated positions on the left. The result is placed into 
register RA. Shift amounts from 64 to 127 give a zero 
result. 


Special Registers Altered: 
CRO (if Re=1) 


Shift Right Word X-form 


srw RA,RS,RB 
srw. RA,RS,RB 


[POWER mnemonics: sr, sr.] 


31 RS RA RB 536 Re 
0 6 11 16 21 31 


n ¢ (RB) 59:63 
r¢ ROTLigo ( (RS) 39.63, 64—n) 
if (RB)sg = 0 then 

m € MASK(n+32, 63) 
else m ¢ 0 
RA¢ré&m 


The contents of the low-order 32 bits of register RS 
are shifted right the number of bits specified by 
(RB)sg.63- Bits shifted out of position 63 are lost. 
Zeros are supplied to the vacated positions on the 
left. The 32-bit result is placed into RAgo.63. RAg-31 
are set to zero. Shift amounts from 32 to 63 give a 
zero result. 


Special Registers Altered: 
CRO (if Re=1) 
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Shift Right Algebraic Doubleword 
Immediate XS-form 


sradi RA,RS,SH (Rc=0) 
sradi. RA,RS,SH (Rce=1) 


31 RS RA sh 413 shiRc 
0 6 1 16 21 30] 31 


n ¢€ shs ll sho-4 

r¢ ROTLgq( (RS) , 64—n) 
m € MASK(n, 63) 

s € (RS)o 

RA € rém | (84s) eam 
CA ¢€s & ((r&-m) #0) 


The contents of register RS are shifted right SH bits. 
Bits shifted out of position 63 are lost. Bit 0 of RS is 
replicated to fill the vacated positions on the left. The 
result is placed into register RA. CA is set to 1 if (RS) 
is negative and any 1-bits are shifted out of position 
63; otherwise CA is set to 0. A shift amount of zero 
causes RA to be set equal to (RS), and CA to be set 
to 0. 


Special Registers Altered: 
CA 
CRO (if Re=1) 


Shift Right Algebraic Word Immediate 
X-form 


srawi RA,RS,SH (Rc=0) 
srawi. RA,RS,SH (Rc=1) 
[POWER mnemonics: srai, srai.] 

31 RS RA SH 824 Re 
0 6 11 16 21 31 


n ¢ SH 

r¢ ROTLigo ( (RS) 39.63, 64—n) 
m ¢ MASK(n+32, 63) 

s € (RS)3 

RA € rém 4 (845) & om 

CA ¢s & ( (x&-m) 30.690) 


The contents of the low-order 32 bits of register RS 
are shifted right SH bits. Bits shifted out of position 
63 are lost. Bit 32 of RS is replicated to fill the 
vacated positions on the left. The 32-bit result is 
placed into RA39.¢3. Bit 32 of RS is replicated to fill 
RAo:31- CA is set to 1 if the low-order 32 bits of (RS) 
contain a negative number and any 1-bits are shifted 
out of position 63; otherwise CA is set to 0. A shift 
amount of zero causes RA to receive EXTS((RS)39.¢3), 
and CA to be set to 0. 


Special Registers Altered: 
CA 
CRO (if Re=1) 
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Shift Right Algebraic Doubleword 
X-form 


srad RA,RS,RB (Ro=0) 
srad. RA.RS,RB (Ro=1) 

31 RS RA RB 794 Re 
0 6 4 16 21 31 
n ¢ (RB) 5:63 


r ¢ ROTLgq( (RS) , 64—n) 
if (RB)57 = 0 then 

m € MASK(n, 63) 
else m ¢ 640 
s € (RS)o 
RA € rém | (84s) g-m 
CA ¢s & ((r&-Mm) 40) 


The contents of register RS are shifted right the 
number of bits specified by (RB)s7.63. Bits shifted out 
of position 63 are lost. Bit 0 of RS is replicated to fill 
the vacated positions on the left. The result is placed 
into register RA. CA is set to 1 if (RS) is negative and 
any 1-bits are shifted out of position 63; otherwise CA 
is set to 0. A shift amount of zero causes RA to be 
set equal to (RS), and CA to be set to 0. Shift 
amounts from 64 to 127 give a result of 64 sign bits in 
RA, and cause CA to receive the sign bit of (RS). 


Special Registers Altered: 
CA 
CRO (if Re=1) 


Shift Right Algebraic Word X-form 


sraw RA,RS,RB 
sraw. RA,RS,RB 


[POWER mnemonics: sra, sra.] 


31 RS RA RB 792 Re 
0 6 11 16 21 31 


n € (RB) 59:63 
r¢ ROTLigo ( (RS) 39.63, 64—n) 
if (RB)sg = 0 then 
m € MASK(n+32, 63) 
else m ¢ 0 
S € (RS) 35 
RA € rém | (84s) om 
CA ¢s & ((r&-M) 39.6340) 


The contents of the low-order 32 bits of register RS 
are shifted right the number of bits specified by 
(RB)sg.63- Bits shifted out of position 63 are lost. Bit 
32 of RS is replicated to fill the vacated positions on 
the left. The 32-bit result is placed into RAgo.g3. Bit 
32 of RS is replicated to fill RAo.3;. CA is set to 1 if 
the low-order 32 bits of (RS) contain a negative 
number and any 1-bits are shifted out of position 63; 
otherwise CA is set to 0. A shift amount of zero 
causes RA to receive EXTS((RS)39.63), and CA to be 
set to 0. Shift amounts from 32 to 63 give a result of 
64 sign bits, and cause CA to receive the sign bit of 
(RS)30-63: 


Special Registers Altered: 
CA 
CRO (if Rc=1) 
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3.3.13 Move To/From System Register Instructions 


The Move To Condition Register Fields instruction has 
a preferred form; see Section 1.9.1, “Preferred 
Instruction Forms” on page 13. In the preferred form, 
the FXM field satisfies the following rule. 


m Exactly one bit of the FXM field is set to 1. 


Extended mnemonics 


Extended mnemonics are provided for the mtspr and 
mfspr instructions so that they can be coded with the 


Move To Special Purpose Register 
XFX-form 


mtspr SPR,RS 


0 6 11 21 31 


n € sprs-g Il Spro-4 

if length(SPREG(n)) = 64 then 
SPREG(n) (RS) 

else 


The SPR field denotes a Special Purpose Register, 
encoded as shown in the table below. The contents of 
register RS are placed into the designated Special 
Purpose Register. For Special Purpose Registers that 
are 32 bits long, the low-order 32 bits of RS are 
placed into the SPR. 


SPR’ Register 
SPI5:9 SPlo:4 Name 


00000 00001 XER 
00000 01000 LR 


decimal 


00000 01001 CTR 


“Note that the order of the two 5-bit 
halves of the SPR number is reversed. 





If the SPR field contains any value other than one of 
the values shown above then one of the following 
occurs. 


= The system illegal instruction error handler is 
invoked. 

m= The system privileged instruction error handler is 
invoked. 

m The results are boundedly undefined. 


A complete description of this instruction can be 
found in Book Ill, PowerPC Operating Environment 
Architecture. 


Special Registers Altered: 
See above 








SPR name as part of the mnemonic rather than as a 
numeric operand. An extended mnemonic is provided 
for the mterf instruction for compatibility with old soft- 
ware (written for a version of the architecture that 
precedes Version 2.00) that uses it to set the entire 
Condition Register. Some of these extended mne- 
monics are shown as examples with the relevant 
instructions. See Appendix B, “Assembler Extended 
Mnemonics” on page 143 for additional extended 
mnemonics. 


Extended Mnemonics: 


Examples of extended mnemonics for Move To 
Special Purpose Register: 


Extended: Equivalent to: 
mtxer Rx mtspr 1,Rx 
mitlr Rx mtspr 8,Rx 
mtctr Rx mtspr 9,Rx 


Compiler and Assembler Note 


For the mtspr and mfspr instructions, the SPR 
number coded in assembler language does not 
appear directly as a 10-bit binary number in the 
instruction. The number coded is split into two 
5-bit halves that are reversed in the instruction, 
with the high-order 5 bits appearing in bits 16:20 
of the instruction and the low-order 5 bits in bits 
11:15. This maintains compatibility with POWER 
SPR encodings, in which these two instructions 
have only a 5-bit SPR field occupying bits 11:15. 


Compatibility Note 


For a discussion of POWER compatibility with 
respect to SPR numbers not shown in the instruc- 
tion descriptions for mtspr and mfspr, see 
Appendix E, “Incompatibilities with the POWER 
Architecture” on page 163. 
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Move From Special Purpose Register 


XFX-form 
mfspr RT,SPR 

31 RT spr 339 / 
0 6 11 21 31 


n € sprs-g Il Spro-4 

if length(SPREG(n)) = 64 then 
RT € SPREG(n) 

else 
RT € 320 || SPREG(n) 


The SPR field denotes a Special Purpose Register, 
encoded as shown in the table below. The contents of 
the designated Special Purpose Register are placed 
into register RT. For Special Purpose Registers that 
are 32 bits long, the low-order 32 bits of RT receive 
the contents of the Special Purpose Register and the 
high-order 32 bits of RT are set to zero. 


SPR’ Register 
decimal = Spfs-9 Spro-4 Name 


00000 00001 XER 
00000 01000 LR 
00000 01001 CTR 


“Note that the order of the two 5-bit 
halves of the SPR number is reversed. 





If the SPR field contains any value other than one of 
the values shown above then one of the following 


occurs. 

= The system illegal instruction error handler is 
invoked. 

= The system privileged instruction error handler is 
invoked. 


m= The results are boundedly undefined. 


A complete description of this instruction can be 
found in Book Ill, PowerPC Operating Environment 
Architecture. 


Special Registers Altered: 
None 
Extended Mnemonics: 


Examples of extended mnemonics for Move From 
Special Purpose Register. 


Extended: Equivalent to: 
mfxer Rx mfspr Rx,1 
mflr Rx mfspr Rx,8 
mfctr Rx mfspr Rx,9 


Note 
See the Notes that appear with mtspr. 
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Move To Condition Register Fields 
XFX-form 


mtcrf FXM,RS 


31 RS |0 FXM / 144 / 
0 6 11] 12 20] 21 31 


mask € 4(FXMp) Il 4(FXM,) Il... 4(FXMz) 
CR € ((RS)g9.gg & mask) | (CR & ->mask) 


The contents of bits 32:63 of register RS are placed 
into the Condition Register under control of the field 
mask specified by FXM. The field mask identifies the 
4-bit fields affected. Let i be an integer in the range 
0-7. If FXMj=1 then CR field i (CR bits 4xi:4xi+3) is 
set to the contents of the corresponding field of the 
low-order 32 bits of RS. 


Special Registers Altered: 
CR fields selected by mask 


Extended Mnemonics: 


Example of extended mnemonics for Move To Condi- 
tion Register Fields: 


Extended: Equivalent to: 
mtcr Rx mtcrf OxFF,Rx 


Programming Note 





In the preferred form of this instruction (see the 
introduction to Section 3.3.13), only one Condition 
Register field is updated. 








Move From Condition Register 
XFX-form 


mfcr RT 

31 RT |0 II 19 / 
0 6 11] 12 21 31 
RT € 320 Il CR 


The contents of the Condition Register are placed into 
RT39.63- RTo-31 are set to 0. 


Special Registers Altered: 
None 
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4.1 Floating-Point Processor 
Overview 


This chapter describes the registers and instructions 
that make up the Floating-Point Processor facility. 
Section 4.2, “Floating-Point Processor Registers” on 
page 82 describes the registers associated with the 
Floating-Point Processor. Section 4.6, “Floating-Point 
Processor Instructions” on page 97 describes the 
instructions associated with the Floating-Point 
Processor. 


4.4.5.1 Definition 
4.4.5.2 Action 
4.5 Floating-Point Execution Models .. 94 
4.5.1 Execution Model for IEEE 


Operations ................ 95 
4.5.2 Execution Model for Multiply-Add 

Type Instructions ............. 96 
4.6 Floating-Point Processor 

Instructions ................ 97 
4.6.1 Floating-Point Storage Access 

Instructions ................ 97 


4.6.1.1 Storage Access Exceptions ... 97 
4.6.2 Floating-Point Load Instructions . 97 
4.6.3 Floating-Point Store Instructions 100 
4.6.4 Floating-Point Move Instructions 104 
4.6.5 Floating-Point Arithmetic 


Instructions ............... 105 
4.6.5.1 Floating-Point Elementary 

Arithmetic Instructions ........ 105 
4.6.5.2 Floating-Point Multiply-Add 

Instructions ............... 107 


4.6.6 Floating-Point Rounding and 
Conversion Instructions ........ 109 
4.6.7 Floating-Point Compare 


Instructions ............... 113 
4.6.8 Floating-Point Status and Control 
Register Instructions ......... 114 


This architecture specifies that the processor imple- 
ment a floating-point system as defined in ANSI/IEEE 
Standard 754-1985, “IEEE Standard for Binary 
Floating-Point Arithmetic” (hereafter referred to as 
“the IEEE standard”), but requires software support in 
order to conform fully with that standard. That 
standard defines certain required “operations” (addi- 
tion, subtraction, etc.); the term “floating-point opera- 
tion” is used in this chapter to refer to one of these 
required operations, or to the operation performed by 
one of the Multiply-Add or Reciprocal Estimate 
instructions. All floating-point operations conform to 
that standard, except if software sets the Floating- 
Point Non-IEEE Mode (NI) bit in the Floating-Point 
Status and Control Register to 1 (see page 84), in 
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which case floating-point operations do not neces- 
sarily conform to that standard. 


Instructions are provided to perform arithmetic, 
rounding, conversion, comparison, and other oper- 
ations in floating-point registers; to move floating- 
point data between storage and these registers; and 
to manipulate the Floating-Point Status and Control 
Register explicitly. 


These instructions are divided into two categories. 


= computational instructions 

The computational instructions are those that 
perform addition, subtraction, multiplication, divi- 
sion, extracting the square root, rounding, con- 
version, Comparison, and combinations of these 
operations. These instructions provide the float- 
ing-point operations. They place status informa- 
tion into the Floating-Point Status and Control 
Register. They are the instructions described in 
Sections 4.6.5 through 4.6.7 and Section 5.2.1. 


= non-computational instructions 

The non-computational instructions are those that 
perform loads and stores, move the contents of a 
floating-point register to another floating-point 
register possibly altering the sign, manipulate the 
Floating-Point Status and Control Register explic- 
itly, and select the value from one of two float- 
ing-point registers based on the value in a third 
floating-point register. The operations performed 
by these instructions are not considered floating- 
point operations. With the exception of the 
instructions that manipulate the Floating-Point 
Status and Control Register explicitly, they do not 
alter the Floating-Point Status and Control Reg- 
ister. They are the instructions described in 
Sections 4.6.2 through 4.6.4, 4.6.8, and 5.2.2. 


A floating-point number consists of a signed exponent 
and a signed significand. The quantity expressed by 
this number is the product of the significand and the 
number 2¢xPonent| Encodings are provided in the data 
format to represent finite numeric values, + Infinity, 
and values that are “Not a Number” (NaN). Oper- 
ations involving infinities produce results obeying tra- 
ditional mathematical conventions. NaNs have no 
mathematical interpretation. Their encoding permits 
a variable diagnostic information field. They may be 
used to indicate such things as uninitialized variables 
and can be produced by certain invalid operations. 


There is one class of exceptional events that occur 
during instruction execution that is unique to the 
Floating-Point Processor: the Floating-Point Exception. 
Floating-point exceptions are signaled with bits set in 
the Floating-Point Status and Control Register 
(FPSCR). They can cause the system floating-point 
enabled exception error handler to be invoked, pre- 
cisely or imprecisely, if the proper control bits are set. 


Floating-Point Exceptions 


The following floating-point exceptions are detected 
by the processor: 


a Invalid Operation Exception (VX) 
SNaN (VXSNAN) 
Infinity— Infinity (VXISI) 
Infinity= Infinity (VXIDI) 
Zero+ Zero (VXZDZ) 
Infinityx Zero (VXIMZ) 
Invalid Compare (VXVC) 
Software Request (VXSOFT) 
Invalid Square Root (VXSQRT) 
Invalid Integer Convert (VXCVI) 

m= Zero Divide Exception (ZX) 

= Overflow Exception (OX) 

a Underflow Exception (UX) 

m Inexact Exception (XX) 


Each floating-point exception, and each category of 
Invalid Operation Exception, has an exception bit in 
the FPSCR. In addition, each floating-point exception 
has a corresponding enable bit in the FPSCR. See 
Section 4.2.2, “Floating-Point Status and Control 
Register” on page 83 for a description of these 
exception and enable bits, and Section 4.4, “Floating- 
Point Exceptions” on page 89 for a detailed dis- 
cussion of floating-point exceptions, including the 
effects of the enable bits. 


4.2 Floating-Point Processor 
Registers 


4.2.1 Floating-Point Registers 


Implementations of this architecture provide 32 float- 
ing-point registers (FPRs). The floating-point instruc- 
tion formats provide 5-bit fields for specifying the 
FPRs to be used in the execution of the instruction. 
The FPRs are numbered 0-31. See Figure 26 on 
page 83. 


Each FPR contains 64 bits that support the floating- 
point double format. Every instruction that interprets 
the contents of an FPR as a floating-point value uses 
the floating-point double format for this interpretation. 


The computational instructions, and the Move and 
Select instructions, operate on data located in FPRs 
and, with the exception of the Compare instructions, 
place the result value into an FPR and optionally 
place status information into the Condition Register. 


Load Double and Store Double instructions are pro- 
vided that transfer 64 bits of data between storage 
and the FPRs with no conversion. Load Single 
instructions are provided to transfer and convert 
floating-point values in floating-point single format 
from storage to the same value in floating-point 
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double format in the FPRs. Store Single instructions 
are provided to transfer and convert floating-point 
values in floating-point double format from the FPRs 
to the same value in floating-point single format in 
storage. 


Instructions are provided that manipulate’ the 
Floating-Point Status and Control Register and the 
Condition Register explicitly. Some of these 
instructions copy data from an FPR to the Floating- 
Point Status and Control Register or vice versa. 


The computational instructions and the Select instruc- 
tion accept values from the FPRs in double format. 
For single-precision arithmetic instructions, all input 
values must be representable in single format; if they 
are not, the result placed into the target FPR, and the 
setting of status bits in the FPSCR and in the Condi- 
tion Register (if Rc=1), are undefined. 


FPR 0 
FPR 1 





FPR 30 
FPR 31 
0 63 


Figure 26. Floating-Point Registers 


4.2.2 Floating-Point Status and 
Control Register 


The Floating-Point Status and Control Register 
(FPSCR) controls the handling of floating-point 
exceptions and records status resulting from the float- 
ing-point operations. Bits 0:23 are status bits. Bits 
24:31 are control bits. 


The exception bits in the FPSCR (bits 3:12, 21:23) are 
sticky; that is, once set to 1 they remain set to 1 until 
they are set to 0 by an merfs, mifsfi, mtfsf, or mtfsb0 
instruction. The exception summary bits in the FPSCR 
(FX, FEX, and VX, which are bits 0:2) are not consid- 
ered to be “exception bits”, and only FX is sticky. 


FEX and VX are simply the ORs of other FPSCR bits. 
Therefore these two bits are not listed among the 
FPSCR bits affected by the various instructions. 


FPSCR 
0 31 


Figure 27. Floating-Point Status and Control Register 


The bit definitions for the FPSCR are as follows. 


Bit(s) Description 


0 


Floating-Point Exception Summary (FX) 

Every floating-point instruction, except mtfsfi 
and mtfsf, implicitly sets FPSCR-ey to 1 if that 
instruction causes any of the floating-point 
exception bits in the FPSCR to change from 0 to 
1. merfs, mtfsfi, mtfsf, mtfsb0, and mtfsb1 can 
alter FPSCRey explicitly. 


Floating-Point Enabled Exception Summary 
(FEX) 

This bit is the OR of all the floating-point excep- 
tion bits masked by their respective enable bits. 
merfs, mtfsfi, mtfsf, mtfsb0, and mtfsb1 cannot 
alter FPSCRrex explicitly. 


Floating-Point 
Summary (VX) 
This bit is the OR of all the Invalid Operation 
exception bits. merfs, mtfsfi, mtfsf, mtfsb0, and 
mtfsb1 cannot alter FPSCRyy explicitly. 


Invalid Operation Exception 


Floating-Point Overflow Exception (OX) 
See Section 4.4.3, “Overflow Exception” on 
page 93. 


Floating-Point Underflow Exception (UX) 
See Section 4.4.4, “Underflow Exception” on 
page 93. 


Floating-Point Zero Divide Exception (ZX) 
See Section 4.4.2, “Zero Divide Exception” on 
page 92. 


Floating-Point Inexact Exception (XX) 
See Section 4.4.5, “Inexact Exception” on 
page 94. 


FPSCRyy is a sticky version of FPSCRr, (see 
below). Thus the following rules completely 
describe how FPSCRy, is set by a given instruc- 
tion. 


m If the instruction affects FPSCR-,, the new 
value of FPSCRyy is obtained by ORing the 
old value of FPSCRxy with the new value of 
FPSCRe}. 

m lf the instruction does not affect FPSCR», 
the value of FPSCRx is unchanged. 


Floating-Point Invalid Operation 
(SNaN) (VXSNAN) 

See Section 4.4.1, “Invalid Operation Exception” 
on page 91. 


Exception 


Floating-Point 
(co — 0 ) (VXISI) 
See Section 4.4.1, “Invalid Operation Exception” 
on page 91. 


Invalid Operation Exception 


Floating-Point 
(co +00 ) (VXIDI) 
See Section 4.4.1, “Invalid Operation Exception” 
on page 91. 


Invalid Operation Exception 
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10 


11 


12 


13 


14 


Floating-Point 
(0= 0) (VXZDZ) 
See Section 4.4.1, “Invalid Operation Exception” 
on page 91. 


Invalid Operation Exception 


Floating-Point 
(0 x0) (VXIMZ) 
See Section 4.4.1, “Invalid Operation Exception” 
on page 91. 


Invalid Operation Exception 


Floating-Point Invalid Operation 
(Invalid Compare) (VXVC) 

See Section 4.4.1, “Invalid Operation Exception” 
on page 91. 


Exception 


Floating-Point Fraction Rounded (FR) 

The last Arithmetic or Rounding and Conversion 
instruction incremented the fraction during 
rounding. See Section 4.3.6, “Rounding” on 
page 89. This bit is not sticky. 


Floating-Point Fraction Inexact (Fl) 

The last Arithmetic or Rounding and Conversion 
instruction either produced an inexact result 
during rounding or caused a disabled Overflow 
Exception. See Section 4.3.6, “Rounding” on 
page 89. This bit is not sticky. 


See the definition of FPSCRyx, above, regarding 
the relationship between FPSCR-, and FPSCRxx. 


15:19 Floating-Point Result Flags (FPRF) 


15 


This field is set as described below. For arith- 
metic, rounding, and conversion instructions, 
the field is set based on the result placed into 
the target register, except that if any portion of 
the result is undefined then the value placed 
into FPRF is undefined. 


Floating-Point Result Class Descriptor (C) 
Arithmetic, rounding, and conversion 
instructions may set this bit with the FPCC bits, 
to indicate the class of the result as shown in 
Figure 28 on page 85. 


16:19 Floating-Point Condition Code (FPCC) 


16 
17 


18 
19 
20 


Floating-point Compare instructions set one of 
the FPCC bits to 1 and the other three FPCC 
bits to 0. Arithmetic, rounding, and conversion 
instructions may set the FPCC bits with the C 
bit, to indicate the class of the result as shown 
in Figure 28 on page 85. Note that in this case 
the high-order three bits of the FPCC retain 
their relational significance indicating that the 
value is less than, greater than, or equal to 
zero. 


Floating-Point Less Than or Negative (FL or <) 


Floating-Point Greater Than or Positive (FG or 
>) 


Floating-Point Equal or Zero (FE or =) 
Floating-Point Unordered or NaN (FU or ?) 


Reserved 


21 


22 


23 


24 


25 


26 


27 


28 


29 


Floating-Point Invalid Operation 
(Software Request) (VXSOFT) 
This bit can be altered only by merfs, mtfsfi, 
mtfsf, mtfsb0, or mtfsb1. See Section 4.4.1, 
“Invalid Operation Exception” on page 91. 


Exception 


Floating-Point Invalid Operation 
(Invalid Square Root) (VXSQRT) 
See Section 4.4.1, “Invalid Operation Exception” 
on page 91. 


Exception 


-—— Programming Note 


If the implementation does not support the 
optional Floating Square Root or Floating 
Reciprocal Square Root Estimate instruction, 
software can simulate the instruction and 
set this bit to reflect the exception. 











Floating-Point Invalid Operation 
(Invalid Integer Convert) (VXCVI) 
See Section 4.4.1, “Invalid Operation Exception” 
on page 91. 


Exception 


Floating-Point 
Enable (VE) 
See Section 4.4.1, “Invalid Operation Exception” 
on page 91. 


Invalid Operation Exception 


Floating-Point Overflow Exception Enable (OE) 
See Section 4.4.3, “Overflow Exception” on 
page 93. 


Floating-Point Underflow Exception Enable (UE) 
See Section 4.4.4, “Underflow Exception” on 
page 93. 


Floating-Point Zero Divide Exception Enable 
(ZE) 

See Section 4.4.2, “Zero Divide Exception” on 
page 92. 


Floating-Point Inexact Exception Enable (XE) 
See Section 4.4.5, “Inexact Exception” on 
page 94. 


Floating-Point Non-IEEE Mode (NI) 
Floating-point non-IEEE mode is optional. If 
floating-point non-IEEE mode is not imple- 
mented, this bit is treated as reserved, and the 
remainder of the definition of this bit does not 
apply. 


If floating-point non-IEEE mode is implemented, 
this bit has the following meaning. 


0 The processor is not in_ floating-point 
non-IEEE mode (i.e., all floating-point oper- 
ations conform to the IEEE standard). 

1 The processor is in floating-point non-IEEE 
mode. 


When the processor is in floating-point 
non-IEEE mode, the remaining FPSCR bits 
may have meanings different from those 
given in this document, and floating-point 
operations need not conform to the IEEE 
standard. The effects of running with 
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FPSCRyj=1, and any additional require- 
ments for using non-lIEEE mode, are 
described in the Book IV, PowerPC Imple- 
mentation Features for the implementation, 
and may differ between implementations. 


-—— Programming Note 





When the processor is in floating-point 
non-IEEE mode, the results of floating- 
point operations may be approximate, 
and performance for these operations 
may be better, more predictable, or 
less data-dependent than when the 
processor is not in non-IEEE mode. For 
example, in non-IEEE mode an imple- 
mentation may return O instead of a 
denormalized number, and may return 
a large number instead of an infinity. 








30:31 Floating-Point Rounding Control (RN) 
See Section 4.3.6, “Rounding” on page 89. 


00 Round to Nearest 

01 Round toward Zero 

10 Round toward + Infinity 
11. Round toward - Infinity 


Result Value Class 


oO 
A 
" 
oe) 


Quiet NaN 

Infinity 

Normalized Number 
Denormalized Number 
Zero 

Zero 

Denormalized Number 
Normalized Number 
Infinity 


CODD OAAHAO 
|=-3 00000 0|V 
COO=-4-0000 
“=CD0D000O=-4 


1 
0 
0 
1 
1 
0 
1 
0 
0 


Figure 28. Floating-Point Result Flags 


4.3 Floating-Point Data 


4.3.1 Data Format 


This architecture defines the representation of a float- 
ing-point value in two different binary fixed-length 
formats. The format may be a 32-bit single format for 
a single-precision value or a 64-bit double format for 
a double-precision value. The single format may be 
used for data in storage. The double format format 
may be used for data in storage and for data in float- 
ing-point registers. 





The lengths of the exponent and the fraction fields 
differ between these two formats. The structure of 
the single and double formats is shown below. 


01 9 31 


Figure 29. Floating-point single format 


EXP FRACTION 
0 


1 12 63 


Figure 30. Floating-point double format 


Values in floating-point format are composed of three 
fields: 


S sign bit 
EXP exponent+bias 
FRACTION fraction 


Representation of numeric values in the floating-point 
formats consists of a sign bit (S), a biased exponent 
(EXP), and the fraction portion (FRACTION) of the 
significand. The significand consists of a leading 
implied bit concatenated on the right with the FRAC- 
TION. This leading implied bit is 1 for normalized 
numbers and O for denormalized numbers and is 
located in the unit bit position (i.e., the first bit to the 
left of the binary point). Values representable within 
the two floating-point formats can be specified by the 
parameters listed in Figure 31. 


Exponent Bias 
Maximum Exponent 
Minimum Exponent 


Widths (bits) 
Format 
Sign 
Exponent 
Fraction 
Significand 





Figure 31. IEEE floating-point fields 


The architecture requires that the FPRs of the 
Floating-Point Processor support the floating-point 
double format only. 
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4.3.2 Value Representation 


This architecture defines numeric and non-numeric 
values representable within each of the two supported 
formats. The numeric values are approximations to 
the real numbers and include the normalized 
numbers, denormalized numbers, and zero values. 
The non-numeric values representable are the infin- 
ities and the Not a Numbers (NaNs). The infinities are 
adjoined to the real numbers, but are not numbers 
themselves, and the standard rules of arithmetic do 
not hold when they are used in an operation. They 
are related to the real numbers by order alone. It is 
possible however to define restricted operations 
among numbers and infinities as defined below. The 
relative location on the real number line for each of 
the defined entities is shown in Figure 32. 





-INF | -NOR | -DEN | -0|+0 | +DEN +NOR |+INF 
<F a er > 


Figure 32. Approximation to real numbers 


The NaNs are not related to the numeric values or 
infinities by order or value but are encodings used to 
convey diagnostic information such as the represen- 
tation of uninitialized variables. 


The following is a description of the different floating- 
point values defined in the architecture: 


Binary floating-point numbers 

Machine representable values used as_ approxi- 
mations to real numbers. Three categories of 
numbers are supported: normalized numbers, denor- 
malized numbers, and zero values. 


Normalized numbers (+ NOR) 
These are values that have a biased exponent value 
in the range: 


1 to 254 in single format 
1 to 2046 in double format 


They are values in which the implied unit bit is 1. 
Normalized numbers are interpreted as follows: 


NOR = (—1)§ x 2E x (1.fraction) 


where s is the sign, E is the unbiased exponent, and 
1.fraction is the significand, which is composed of a 
leading unit bit (implied bit) and a fraction part. 


The ranges covered by the magnitude (M) of a nor- 
malized floating-point number are approximately 
equal to: 
Single Format: 
1.2x10-38 < M < 3.4x1038 


Double Format: 
2.2x10~ 308 < M < 1.8x10908 


Zero values (+ 0) 

These are values that have a biased exponent value 
of zero and a fraction value of zero. Zeros can have 
a positive or negative sign. The sign of zero is 
ignored by comparison operations (i.e., Comparison 
regards +0 as equal to —0). 


Denormalized numbers (+ DEN) 

These are values that have a biased exponent value 
of zero and a nonzero fraction value. They are 
nonzero numbers smaller in magnitude than the 
representable normalized numbers. They are values 
in which the implied unit bit is 0. Denormalized 
numbers are interpreted as follows: 


DEN = (—1)§ x 2E™In x (0.fraction) 


where Emin is the minimum representable exponent 
value (—126 for single-precision, — 1022 for double- 
precision). 


Infinities (+ ) 
These are values that have the maximum biased 
exponent value: 


255 in single format 
2047 in double format 


and a zero fraction value. They are used to approxi- 
mate values greater in magnitude than the maximum 
normalized value. 


Infinity arithmetic is defined as the limiting case of 
real arithmetic, with restricted operations defined 
among numbers and infinities. Infinities and the real 
numbers can be related by ordering in the affine 
sense: 


—oo < every finite number < +0 


Arithmetic on infinities is always exact and does not 
signal any exception, except when an exception 
occurs due to the invalid operations as described in 
Section 4.4.1, “Invalid Operation Exception” on 
page 91. 


Not a Numbers (NaNs) 

These are values that have the maximum biased 
exponent value and a nonzero fraction value. The 
sign bit is ignored (i.e., NaNs are neither positive nor 
negative). If the high-order bit of the fraction field is 
0 then the NaN is a Signaling NaN; otherwise it is a 
Quiet NaN. 


Signaling NaNs are used to signal exceptions when 
they appear as operands of computational 
instructions. 


Quiet NaNs are used to represent the results of 
certain invalid operations, such as invalid arithmetic 
operations on infinities or on NaNs, when Invalid 
Operation Exception is disabled (FPSCRyp=0). Quiet 
NaNs propagate through all floating-point operations 
except ordered comparison, Floating Round to Single- 
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Precision, and conversion to integer. Quiet NaNs do 
not signal exceptions, except for ordered comparison 
and conversion to integer operations. Specific 
encodings in QNaNs can thus be preserved through a 
sequence of floating-point operations, and used to 
convey diagnostic information to help identify results 
from invalid operations. 


When a QNaN is the result of a floating-point opera- 
tion because one of the operands is a NaN or because 
a QNaN was generated due to a disabled Invalid 
Operation Exception, then the following rule is applied 
to determine the NaN with the high-order fraction bit 
set to 1 that is to be stored as the result. 


if (FRA) is a NaN 
then FRT € (FRA) 
else if (FRB) is a NaN 
then if instruction is frsp 
then FRT ¢ (FRB)o-34 II 290 
else FRT € (FRB) 
else if (FRC) is a NaN 
then FRT € (FRC) 
else if generated QNaN 
then FRT € generated QNaN 


If the operand specified by FRA is a NaN, then that 
NaN is stored as the result. Otherwise, if the operand 
specified by FRB is a NaN (if the instruction specifies 
an FRB operand), then that NaN is stored as the 
result, with the low-order 29 bits of the result set to 0 
if the instruction is frsp. Otherwise, if the operand 
specified by FRC is a NaN (if the instruction specifies 
an FRC operand), then that NaN is stored as the 
result. Otherwise, if a QNaN was generated due to a 
disabled Invalid Operation Exception, then that QNaN 
is stored as the result. If a QNaN is to be generated 
as a result, then the QNaN generated has a sign bit of 
0, an exponent field of all 1s, and a high-order fraction 
bit of 1 with all other fraction bits 0. Any instruction 
that generates a QNaN as the result of a disabled 
Invalid Operation Exception generates this QNaN (i.e., 
0x7FF8_0000_0000_0000). 


A double-precision NaN is considered to be represent- 
able in single format if and only if the low-order 29 
bits of the double-precision NaN's fraction are zero. 


4.3.3 Sign of Result 


The following rules govern the sign of the result of an 
arithmetic, rounding, or conversion operation, when 
the operation does not yield an exception. They apply 
even when the operands or results are zeros or infin- 
ities. 
m= The sign of the result of an add operation is the 
sign of the operand having the larger absolute 
value. If both operands have the same sign, the 
sign of the result of an add operation is the same 
as the sign of the operands. The sign of the 


result of the subtract operation x—y is the same 
as the sign of the result of the add operation 
X+(—Y). 

When the sum of two operands with opposite 
sign, or the difference of two operands with the 
same sign, is exactly zero, the sign of the result 
is positive in all rounding modes except Round 
toward — Infinity, in which mode the sign is nega- 
tive. 


m The sign of the result of a multiply or divide oper- 
ation is the Exclusive OR of the signs of the oper- 
ands. 


= The sign of the result of a Square Root or Recip- 
rocal Square Root Estimate operation is always 
positive, except that the square root of —0 is —0 
and the reciprocal square root of —0 is — Infinity. 


m= The sign of the result of a Round to Single- 
Precision or Convert To/From Integer operation is 
the sign of the operand being converted. 


For the Multiply-Add instructions, the rules given 
above are applied first to the multiply operation and 
then to the add or subtract operation (one of the 
inputs to the add or subtract operation is the result of 
the multiply operation). 


4.3.4 Normalization and 
Denormalization 


The intermediate result of an arithmetic or frsp 
instruction may require normalization and/or denor- 
malization as described below. Normalization and 
denormalization do not affect the sign of the result. 


When an arithmetic or frsp instruction produces an 
intermediate result, consisting of a sign bit, an expo- 
nent, and a nonzero significand with a 0 leading bit, it 
is not a normalized number and must be normalized 
before it is stored. 


A number is normalized by shifting its significand left 
while decrementing its exponent by 1 for each bit 
shifted, until the leading significand bit becomes 1. 
The Guard bit and the Round bit (see Section 4.5.1, 
“Execution Model for IEEE Operations” on page 95) 
participate in the shift with zeros shifted into the 
Round bit. The exponent is regarded as if its range 
were unlimited. 


After normalization, or if normalization was not 
required, the intermediate result may have a nonzero 
significand and an exponent value that is less than 
the minimum value that can be represented in the 
format specified for the result. In this case, the inter- 
mediate result is said to be “Tiny” and the stored 
result is determined by the rules described in Section 
4.4.4, “Underflow Exception” on page 93. These rules 
may require denormailization. 
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A number is denormalized by shifting its significand 
right while incrementing its exponent by 1 for each bit 
shifted, until the exponent is equal to the format's 
minimum value. If any significant bits are lost in this 
shifting process then “Loss of Accuracy” has occurred 
(See Section 4.4.4, “Underflow Exception” on 
page 93) and Underflow Exception is signaled. 


4.3.5 Data Handling and Precision 


Instructions are defined to move floating-point data 
between the FPRs and storage. For double format 
data, the data are not altered during the move. For 
single format data, a format conversion from single to 
double is performed when loading from storage into 
an FPR and a format conversion from double to single 
is performed when storing from an FPR to storage. 
No floating-point exceptions are caused by these 
instructions. 


All computational, Move, and Select instructions use 
the floating-point double format. 


Floating-point single-precision is obtained with the 
implementation of four types of instruction. 


1. Load Floating-Point Single 


This form of instruction accesses a_ single- 
precision operand in single format in storage, 
converts it to double format, and loads it into an 
FPR. No floating-point exceptions are caused by 
these instructions. 


2. Round to Floating-Point Single-Precision 


The Floating Round to Single-Precision instruction 
rounds a double-precision operand to  single- 
precision, checking the exponent for single- 
precision range and handling any exceptions 
according to respective enable bits, and places 
that operand into an FPR as a double-precision 
operand. For results produced by single- 
precision arithmetic instructions, single-precision 
loads, and other instances of the Floating Round 
to Single-Precision instruction, this operation 
does not alter the value. 


3. Single-Precision Arithmetic Instructions 


This form of instruction takes operands from the 
FPRs in double format, performs the operation as 
if it produced an intermediate result having infi- 
nite precision and unbounded exponent range, 
and then coerces this intermediate result to fit in 
single format. Status bits, in the FPSCR and 
optionally in the Condition Register, are set to 
reflect the single-precision result. The result is 
then converted to double format and placed into 


an FPR. The result lies in the range supported by 
the single format. 


All input values must be representable in single 
format; if they are not, the result placed into the 
target FPR, and the setting of status bits in the 
FPSCR and in the Condition Register (if Rc=1), 
are undefined. 


4. Store Floating-Point Single 


This form of instruction converts a double- 
precision operand to single format and stores 
that operand into storage. No _ floating-point 
exceptions are caused by these _ instructions. 
(The value being stored is effectively assumed to 
be the result of an instruction of one of the pre- 
ceding three types.) 


When the result of a Load Floating-Point Single, 
Floating Round to Single-Precision, or single-precision 
arithmetic instruction is stored in an FPR, the low- 
order 29 FRACTION bits are zero. 





Programming Note 


The Floating Round to Single-Precision instruction 
is provided to allow value conversion from 
double-precision to single-precision with appro- 
priate exception checking and rounding. This 
instruction should be used to convert double- 
precision floating-point values (produced by 
double-precision load and arithmetic instructions 
and by fefid) to single-precision values prior to 
storing them into single format storage elements 
or using them as operands for single-precision 
arithmetic instructions. Values produced by 
single-precision load and arithmetic instructions 
are already single-precision values and can be 
stored directly into single format storage ele- 
ments, or used directly as operands for single- 
precision arithmetic instructions, without 
preceding the store, or the arithmetic instruction, 
by a Floating Round to Single-Precision instruc- 
tion. 











Programming Note 


A single-precision value can be used in double- 
precision arithmetic operations. The reverse is 
true only if the double-precision value is repre- 
sentable in single format. 


Some implementations may execute _ single- 
precision arithmetic instructions faster than 
double-precision arithmetic instructions. There- 


fore, if double-precision accuracy is not required, 
single-precision data and instructions should be 
used. 











88 PowerPC User Instruction Set Architecture 


Version 2.01 





4.3.6 Rounding 


The material in this section applies to operations that 
have numeric operands (i.e., operands that are not 
infinities or NaNs). Rounding the intermediate result 
of such an operation may cause an Overflow Excep- 
tion, an Underflow Exception, or an Inexact Exception. 
The remainder of this section assumes that the opera- 
tion causes no exceptions and that the result is 
numeric. See Section 4.3.2, “Value Representation” 
on page 86 and Section 4.4, “Floating-Point 
Exceptions” on page 89 for the cases not covered 
here. 


The arithmetic, rounding, and conversion instructions 
produce an intermediate result that can be regarded 
as having infinite precision and unbounded exponent 
range. This intermediate result is normalized or 
denormalized if required, then rounded to the destina- 
tion format. The final result is then placed into the 
target FPR in double format or in fixed-point integer 
format, depending on the instruction. 


The instructions that round their intermediate result 
are the Arithmetic and Rounding and Conversion 
instructions. Each of these instructions sets FPSCR 
bits FR and Fl. If the fraction was incremented during 
rounding then FR is set to 1, otherwise FR is set to 0. 
If the rounded result is inexact then FI is set to 1, oth- 
erwise FI is set to 0. 


The two Estimate instructions set FR and FI to unde- 
fined values. The remaining floating-point instructions 
do not alter FR and FI. 


Four user-selectable rounding modes are provided 
through the Floating-Point Rounding Control field in 
the FPSCR. See Section 4.2.2, “Floating-Point Status 
and Control Register” on page 83. These are 
encoded as follows: 


RN Rounding Mode 

00 Round to Nearest 

01 Round toward Zero 

10 Round toward + Infinity 
11 Round toward - Infinity 


Let Z be the intermediate arithmetic result or the 
operand of a convert operation. If Z can be repres- 
ented exactly in the target format, then the result in 
all rounding modes is Z as represented in the target 
format. If Z cannot be represented exactly in the 
target format, let Z1 and Z2 bound Z as the next 
larger and next smaller numbers representable in the 
target format. Then Z1 or Z2 can be used to approxi- 
mate the result in the target format. 


Figure 33 shows the relation of Z, Z1, and Z2 in this 
case. The following rules specify the rounding in the 
four modes. “LSB” means “least significant bit”. 


By Incrementing LSB of Z 
Infinitely Precise Value 
r— By Truncating after LSB —— 


vwvvyv vvyv 























Z2| Zl 0 Z2| Zl 
Z Z 
Negative values <—j—» Positive values 











Figure 33. Selection of Z1 and Z2 


Round to Nearest 
Choose the value that is closer to Z (Z1 or Z2). 
In case of a tie, choose the one that is even 
(least significant bit 0). 


Round toward Zero 
Choose the smaller in magnitude (Z1 or Z2). 


Round toward + Infinity 
Choose 21. 


Round toward — Infinity 
Choose 22. 


See Section 4.5.1, “Execution Model for IEEE 
Operations” on page 95 for a detailed explanation of 
rounding. 


4.4 Floating-Point Exceptions 


This architecture defines the following floating-point 
exceptions: 


a Invalid Operation Exception 
SNaN 
Infinity— Infinity 
Infinity= Infinity 
Zero+ Zero 
Infinityx Zero 
Invalid Compare 
Software Request 
Invalid Square Root 
Invalid Integer Convert 

a Zero Divide Exception 

= Overflow Exception 

a Underflow Exception 

m Inexact Exception 


These exceptions may occur during execution of com- 
putational instructions. In addition, an Invalid Opera- 
tion Exception occurs when a Move To FPSCR 
instruction sets FPSCRyysorr to 1 (Software Request). 


Each floating-point exception, and each category of 
Invalid Operation Exception, has an exception bit in 
the FPSCR. In addition, each floating-point exception 
has a corresponding enable bit in the FPSCR. The 
exception bit indicates occurrence of the corre- 
sponding exception. If an exception occurs, the corre- 
sponding enable bit governs the result produced by 
the instruction and, in conjunction with the FEO and 
FE1 bits (see page 90), whether and how the system 
floating-point enabled exception error handler is 
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invoked. (In general, the enabling specified by the 
enable bit is of invoking the system error handler, not 
of permitting the exception to occur. The occurrence 
of an exception depends only on the instruction and 
its inputs, not on the setting of any control bits. The 
only deviation from this general rule is that the occur- 
rence of an Underflow Exception may depend on the 
setting of the enable bit.) 


A single instruction, other than mtfsfi or mtfsf, may 
set more than one exception bit only in the following 
cases: 


m Inexact Exception may be set with Overflow 
Exception. 

m Inexact Exception may be set with Underflow 
Exception. 

a Invalid Operation Exception (SNaN) is set with 
Invalid Operation Exception (co x0) for 
Multiply-Add instructions for which the values 
being multiplied are infinity and zero and the 
value being added is an SNaN. 

= Invalid Operation Exception (SNaN) may be set 
with Invalid Operation Exception (Invalid 
Compare) for Compare Ordered instructions. 

a Invalid Operation Exception (SNaN) may be set 
with Invalid Operation Exception (Invalid Integer 
Convert) for Convert To Integer instructions. 


When an exception occurs the instruction execution 
may be suppressed or a result may be delivered, 
depending on the exception. 


Instruction execution is suppressed for the following 
kinds of exception, so that there is no possibility that 
one of the operands is lost: 


m Enabled Invalid Operation 
m Enabled Zero Divide 


For the remaining kinds of exception, a result is gen- 
erated and written to the destination specified by the 
instruction causing the exception. The result may be 
a different value for the enabled and disabled condi- 
tions for some of these exceptions. The kinds of 
exception that deliver a result are the following: 


Disabled Invalid Operation 
Disabled Zero Divide 
Disabled Overflow 
Disabled Underflow 
Disabled Inexact 

Enabled Overflow 

Enabled Underflow 
Enabled Inexact 


Subsequent sections define each of the floating-point 
exceptions and specify the action that is taken when 
they are detected. 


The IEEE standard specifies the handling of excep- 
tional conditions in terms of “traps” and “trap han- 
dlers”. In this architecture, an FPSCR exception 
enable bit of 1 causes generation of the result value 


specified in the IEEE standard for the “trap enabled” 
case; the expectation is that the exception will be 
detected by software, which will revise the result. An 
FPSCR exception enable bit of 0 causes generation of 
the “default result” value specified for the “trap disa- 
bled” (or “no trap occurs” or “trap is not imple- 
mented”) case; the expectation is that the exception 
will not be detected by software, which will simply use 
the default result. The result to be delivered in each 
case for each exception is described in the sections 
below. 


The IEEE default behavior when an exception occurs 
is to generate a default value and not to notify soft- 
ware. In this architecture, if the IEEE default behavior 
when an exception occurs is desired for all 
exceptions, all FPSCR exception enable bits should be 
set to 0 and Ignore Exceptions Mode (see below) 
should be used. In this case the system floating-point 
enabled exception error handler is not invoked, even 
if floating-point exceptions occur: software can inspect 
the FPSCR exception bits if necessary, to determine 
whether exceptions have occurred. 


In this architecture, if software is to be notified that a 
given kind of exception has occurred, the corre- 
sponding FPSCR exception enable bit must be set to 1 
and a mode other than Ignore Exceptions Mode must 
be used. In this case the system floating-point 
enabled exception error handler is invoked if an 
enabled floating-point exception occurs. The system 
floating-point enabled exception error handler is also 
invoked if a Move To FPSCR instruction causes an 
exception bit and the corresponding enable bit both to 
be 1; the Move To FPSCR instruction is considered to 
cause the enabled exception. 


The FEO and FE1 bits control whether and how the 
system floating-point enabled exception error handler 
is invoked if an enabled floating-point exception 
occurs. The location of these bits and the require- 
ments for altering them are described in Book Ill, 
PowerPC Operating Environment Architecture. (The 
system floating-point enabled exception error handler 
is never invoked because of a disabled floating-point 
exception.) The effects of the four possible settings of 
these bits are as follows. 


FEO FE1 Description 


0 O- Ignore Exceptions Mode 
Floating-point exceptions do not cause the 
system floating-point enabled exception 
error handler to be invoked. 


0 1  Imprecise Nonrecoverable Mode 
The system floating-point enabled exception 
error handler is invoked at some point at or 
beyond the instruction that caused the 
enabled exception. It may not be possible to 
identify the excepting instruction or the data 
that caused the exception. Results 
produced by the excepting instruction may 
have been used by or may have affected 
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subsequent instructions that are executed 
before the error handler is invoked. 


1 0 Imprecise Recoverable Mode 

The system floating-point enabled exception 
error handler is invoked at some point at or 
beyond the instruction that caused the 
enabled exception. Sufficient information is 
provided to the error handler that it can 
identify the excepting instruction and the 
operands, and correct the result. No results 
produced by the excepting instruction have 
been used by or have affected subsequent 
instructions that are executed before the 
error handler is invoked. 


1 1 Precise Mode 
The system floating-point enabled exception 
error handler is invoked precisely at the 
instruction that caused the enabled excep- 
tion. 


In all cases, the question of whether a floating-point 
result is stored, and what value is stored, is governed 
by the FPSCR exception enable bits, as described in 
subsequent sections, and is not affected by the value 
of the FEO and FE1 bits. 


In all cases in which the system floating-point enabled 
exception error handler is invoked, all instructions 
before the instruction at which the system floating- 
point enabled exception error handler is invoked have 
completed, and no instruction after the instruction at 
which the system floating-point enabled exception 
error handler is invoked has begun execution. (Recall 
that, for the two Imprecise modes, the instruction at 
which the system floating-point enabled exception 
error handler is invoked need not be the instruction 
that caused the exception.) The instruction at which 
the system floating-point enabled exception error 
handler is invoked has not been executed unless it is 
the excepting instruction, in which case it has been 
executed if the exception is not among those listed on 
page 90 as suppressed. 





Programming Note 


In any of the three non-Precise modes, a Floating- 
Point Status and Control Register instruction can 
be used to force any exceptions, due to 
instructions initiated before the Floating-Point 
Status and Control Register instruction, to be 
recorded in the FPSCR. (This forcing is super- 
fluous for Precise Mode.) 


In either of the Imprecise modes, a Floating-Point 
Status and Control! Register instruction can be 


In order to obtain the best performance across the 
widest range of implementations, the programmer 
should obey the following guidelines. 


If the IEEE default results are acceptable to the 
application, Ignore Exceptions Mode should be 
used with all FPSCR exception enable bits set to 
0. 


If the IEEE default results are not acceptable to 
the application, Imprecise Nonrecoverable Mode 
should be used, or Imprecise Recoverable Mode 
if recoverability is needed, with FPSCR exception 
enable bits set to 1 for those exceptions for which 
the system floating-point enabled exception error 
handler is to be invoked. 


Ignore Exceptions Mode should not, in general, be 
used when any FPSCR exception enable bits are 
set to 1. 


Precise Mode may degrade performance in some 
implementations, perhaps substantially, and 
therefore should be used only for debugging and 
other specialized applications. 


4.4.1 Invalid Operation Exception 


4.4.1.1 Definition 


An 


Invalid Operation Exception occurs when an 


operand is invalid for the specified operation. The 
invalid operations are: 


Any floating-point operation on a Signaling NaN 
(SNaN) 

For add or subtract operations, magnitude sub- 
traction of infinities (co — co ) 

Division of infinity by infinity (co + co ) 

Division of zero by zero (0+ 0) 

Multiplication of infinity by zero (co x0) 

Ordered comparison involving a NaN (Invalid 
Compare) 

Square root or reciprocal square root of a nega- 
tive (and nonzero) number (Invalid Square Root) 
Integer convert involving a number too large in 
magnitude to be represented in the target format, 
or involving an infinity or a NaN (Invalid Integer 
Convert) 


In addition, an Invalid Operation Exception occurs if 
software explicitly requests this by executing an 
mtfsfi, mtfsf, or mtfsb1 instruction that sets 
FPSCRyxsort to 1 (Software Request). 


Programming Note 


used to force any invocations of the system float- 
ing-point enabled exception error handler, due to 
instructions initiated before the Floating-Point 
Status and Control Register instruction, to occur. 
(This forcing has no effect in Ignore Exceptions 
Mode, and is superfluous for Precise Mode.) 








The purpose of FPSCRyxsoer is to allow software 
to cause an Invalid Operation Exception for a con- 
dition that is not necessarily associated with the 
execution of a floating-point instruction. For 
example, it might be set by a program that com- 
putes a square root, if the source operand is neg- 
ative. 
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4.4.1.2 Action 


The action to be taken depends on the setting of the 
Invalid Operation Exception Enable bit of the FPSCR. 


When Invalid Operation Exception is enabled 
(FPSCRye=1) and Invalid Operation occurs or soft- 
ware explicitly requests the exception, the following 
actions are taken: 


1. One or two Invalid Operation Exceptions are set 


FPSCRyxsnan (if SNaN) 
FPSCRyxis) (if 2 — 20 ) 
FPSCRyxip1 (if 2 +20 ) 

F CRyxzpz (if 0+ 0) 

FPSCRy xz (if co x0) 
FPSCRyxvc (if invalid comp) 
FPSCRyxsort (if software req) 
FPSCRyxsqrt (if invalid sqrt) 
FPSCRyxcv1 (if invalid int cvrt) 


2. If the operation is an arithmetic, Floating Round 
to Single-Precision, or convert to integer opera- 
tion, 

the target FPR is unchanged 
FPSCRrp fF) are set to zero 
FPSCRrppr is unchanged 
3. If the operation is a compare, 
FPSCRer Fc are unchanged 
FPSCRepcc is set to reflect unordered 
4. lf software explicitly requests the exception, 
FPSCRre fF) cpap are as set by the mitfsfi, 
mtfsf, or mtfsb1 instruction 


When Invalid Operation Exception is disabled 
(FPSCRye=0) and Invalid Operation occurs or soft- 
ware explicitly requests the exception, the following 
actions are taken: 


1. One or two Invalid Operation Exceptions are set 


FPSCRyxsnan (if SNaN) 
FPSCRyxis) (if 2 — ce ) 
FPSCRyxip1 (if ee +e ) 

F CRyxzpz (if 0+ 0) 
FPSCRyxiz (if co x0) 
FPSCRyxvc (if invalid comp) 
FPSCRyxsort (if software req) 
FPSCRyxsqrt (if invalid sqrt) 
FPSCRyxcv1 (if invalid int cvrt) 


2. If the operation is an arithmetic or Floating 
Round to Single-Precision operation, 
the target FPR is set to a Quiet NaN 
FPSCRrp fF) are set to zero 
FPSCRrppr is set to indicate the class of the 
result (Quiet NaN) 
3. If the operation is a convert to 64-bit integer 
operation, 
the target FPR is set as follows: 
FRT is set to the most positive 64-bit 
integer if the operand in FRB is a posi- 
tive number or + <, and to the most 


negative 64-bit integer if the operand in 
FRB is a negative number, — °, or NaN 
FPSCRrp fF) are set to zero 
FPSCRrppr¢ is undefined 
4. If the operation is a convert to 32-bit integer 
operation, 
the target FPR is set as follows: 
FRT 9.31 € undefined 
FRT39.63 are set to the most positive 
32-bit integer if the operand in FRB is a 
positive number or + c, and to the most 
negative 32-bit integer if the operand in 
FRB is a negative number, — , or NaN 
FPSCRrp fF) are set to zero 
FPSCRrppr¢ is undefined 
5. If the operation is a compare, 
FPSCRer Fc are unchanged 
FPSCRepcc is set to reflect unordered 
6. If software explicitly requests the exception, 
FPSCRee fF) cpap are as set by the mifsfi, 
mtfsf, or mtfsb71 instruction 


4.4.2 Zero Divide Exception 
4.4.2.1 Definition 


A Zero Divide Exception occurs when a Divide instruc- 
tion is executed with a zero divisor value and a finite 
nonzero dividend value. It also occurs when a Recip- 
rocal Estimate instruction (fres or frsqrte) is executed 
with an operand value of zero. 


4.4.2.2 Action 


The action to be taken depends on the setting of the 
Zero Divide Exception Enable bit of the FPSCR. 


When Zero Divide Exception is enabled (FPSCRz-=1 ) 
and Zero Divide occurs, the following actions are 
taken: 


1. Zero Divide Exception is set 
FPSCRzy ¢ 1 

2. The target FPR is unchanged 

3. FPSCRep fF; are set to zero 

4. FPSCReppr is unchanged 


When Zero Divide Exception is disabled (FPSCRz-=0) 
and Zero Divide occurs, the following actions are 
taken: 


1. Zero Divide Exception is set 
FPSCRzy ¢ 1 

2. The target FPR is set to +Infinity, where the sign 
is determined by the XOR of the signs of the 
operands 

3. FPSCRep rf; are set to zero 

4. FPSCReppr is set to indicate the class and sign of 
the result (+ Infinity) 
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4.4.3 Overflow Exception 
4.4.3.1 Definition 


Overflow occurs when the magnitude of what would 
have been the rounded result if the exponent range 
were unbounded exceeds that of the largest finite 
number of the specified result precision. 


4.4.3.2 Action 


The action to be taken depends on the setting of the 
Overflow Exception Enable bit of the FPSCR. 


When Overflow Exception is enabled (FPSCRo¢=1) 
and exponent overflow occurs, the following actions 
are taken: 


1. Overflow Exception is set 
FPSCRox € 1 

2. For double-precision arithmetic instructions, the 
exponent of the normalized intermediate result is 
adjusted by subtracting 1536 

3. For single-precision arithmetic instructions and 
the Floating Round to Single-Precision instruc- 
tion, the exponent of the normalized intermediate 
result is adjusted by subtracting 192 

4. The adjusted rounded result is placed into the 
target FPR 

5. FPSCReppr is set to indicate the class and sign of 
the result (+ Normal Number) 


When Overflow Exception is disabled (FPSCRo¢=0) 
and overflow occurs, the following actions are taken: 


1. Overflow Exception is set 
FPSCRox € 1 
2. Inexact Exception is set 
FPSCRxx € 1 
3. The result is determined by the rounding mode 
(FPSCRen) and the sign of the intermediate result 
as follows: 
A. Round to Nearest 
Store + Infinity, where the sign is the sign of 
the intermediate result 
B. Round toward Zero 
Store the format's largest finite number with 
the sign of the intermediate result 
C. Round toward + Infinity 
For negative overflow, store the format's 
most negative finite number; for positive 
overflow, store + Infinity 
D. Round toward — Infinity 
For negative overflow, store —Infinity; for 
positive overflow, store the format's largest 
finite number 
The result is placed into the target FPR 
FPSCRrp is undefined 
FPSCRr, is set to 1 
FPSCRrppr is set to indicate the class and sign of 
the result (+ Infinity or + Normal Number) 


SEO Ol 


4.4.4 Underflow Exception 
4.4.4.1 Definition 


Underflow Exception is defined separately for the 
enabled and disabled states: 


m Enabled: 
Underflow occurs when the intermediate result is 
“Tiny”. 

a Disabled: 
Underflow occurs when the intermediate result is 
“Tiny” and there is “Loss of Accuracy”. 


A “Tiny” result is detected before rounding, when a 
nonzero intermediate result computed as though both 
the precision and the exponent range were 
unbounded would be less in magnitude than the 
smallest normalized number. 


If the intermediate result is “Tiny” and Underflow 
Exception is disabled (FPSCRyg=0) then the interme- 
diate result is denormalized (see Section 4.3.4, “Nor- 
malization and Denormalization” on page 87) and 
rounded (see Section 4.3.6, “Rounding” on page 89) 
before being placed into the target FPR. 


“Loss of Accuracy” is detected when the delivered 
result value differs from what would have been com- 
puted were both the precision and the exponent range 
unbounded. 


4.4.4.2 Action 


The action to be taken depends on the setting of the 
Underflow Exception Enable bit of the FPSCR. 


When Underflow Exception is enabled (FPSCRyg=1 ) 
and exponent underflow occurs, the following actions 
are taken: 


1. Underflow Exception is set 
FPSCRyx € 1 

2. For double-precision arithmetic instructions, the 
exponent of the normalized intermediate result is 
adjusted by adding 1536 

3. For single-precision arithmetic instructions and 
the Floating Round to Single-Precision instruc- 
tion, the exponent of the normalized intermediate 
result is adjusted by adding 192 

4. The adjusted rounded result is placed into the 
target FPR 

5. FPSCReppr is set to indicate the class and sign of 
the result (+ Normalized Number) 
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Programming Note 


The FR and FI bits are provided to allow the 
system floating-point enabled exception error 
handler, when invoked because of an Underflow 
Exception, to simulate a “trap disabled” environ- 
ment. That is, the FR and FI bits allow the system 
floating-point enabled exception error handler to 
unround the result, thus allowing the result to be 
denormalized. 








When Underflow Exception is disabled (FPSCRyg=0) 
and underflow occurs, the following actions are taken: 


1. Underflow Exception is set 
FPSCRyx ¢ 1 
2. The rounded result is placed into the target FPR 
3. FPSCReppr is set to indicate the class and sign of 
the result (t Normalized Number, + Denormalized 
Number, or + Zero) 


4.4.5 Inexact Exception 
4.4.5.1 Definition 


An Inexact Exception occurs when one of two condi- 
tions occur during rounding: 


1. The rounded result differs from the intermediate 
result assuming both the precision and the expo- 
nent range of the intermediate result to be 
unbounded. In this case the result is said to be 
inexact. (If the rounding causes an enabled Over- 
flow Exception or an enabled Underflow Excep- 
tion, an Inexact Exception also occurs only if the 
significands of the rounded result and the inter- 
mediate result differ.) 

2. The rounded result overflows and Overflow 
Exception is disabled. 


4.4.5.2 Action 


The action to be taken does not depend on the setting 
of the Inexact Exception Enable bit of the FPSCR. 


When Inexact Exception occurs, the following actions 
are taken: 


1. Inexact Exception is set 
FPSCRxx € 1 
2. The rounded or overflowed result is placed into 
the target FPR 
3. FPSCReppr is set to indicate the class and sign of 
the result 


Programming Note 





In| some implementations, Inexact 


enabling 
Exceptions may degrade performance more than 
does enabling other types of floating-point excep- 
tion. 





4.5 Floating-Point Execution 
Models 


All implementations of this architecture must provide 
the equivalent of the following execution models to 
ensure that identical results are obtained. 


Special rules are provided in the definition of the 
computational instructions for the infinities, denormal- 
ized numbers and NaNs. The material in the 
remainder of this section applies to instructions that 
have numeric operands and a numeric result (i.e., 
operands and result that are not infinities or NaNs), 
and that cause no exceptions. See Section 4.3.2, 
“Value Representation” on page 86 and Section 4.4, 
“Floating-Point Exceptions” on page 89 for the cases 
not covered here. 


Although the double format specifies an 11-bit expo- 
nent, exponent arithmetic makes use of two additional 
bits to avoid potential transient overflow conditions. 
One extra bit is required when denormalized double- 
precision numbers are prenormalized. The second bit 
is required to permit the computation of the adjusted 
exponent value in the following cases when the corre- 
sponding exception enable bit is 1: 


mu Underflow during multiplication using a denormal- 
ized operand. 

= Overflow during division using a denormalized 
divisor. 


The IEEE standard includes 32-bit and 64-bit arith- 
metic. The standard requires that single-precision 
arithmetic be provided for single-precision operands. 
The standard permits double-precision floating-point 
operations to have either (or both) single-precision or 
double-precision operands, but states that single- 
precision floating-point operations should not accept 
double-precision operands. The PowerPC Architecture 
follows these guidelines; double-precision arithmetic 
instructions can have operands of either or both pre- 
cisions, while single-precision arithmetic instructions 
require all operands to be single-precision. Double- 
precision arithmetic instructions and fefid produce 
double-precision values, while single-precision arith- 
metic instructions produce single-precision values. 


For arithmetic instructions, conversions from double- 
precision to single-precision must be done explicitly 
by software, while conversions from single-precision 
to double-precision are done implicitly. 
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4.5.1 Execution Model for IEEE 
Operations 


The following description uses 64-bit arithmetic as an 
example. 32-bit arithmetic is similar except that the 
FRACTION is a 23-bit field, and the single-precision 
Guard, Round, and Sticky bits (described in this 
section) are logically adjacent to the 23-bit FRACTION 
field. 


IEEE-conforming significand arithmetic is considered 
to be performed with a floating-point accumulator 
having the following format, where bits 0:55 comprise 
the significand of the intermediate result. 


S}C]L FRACTION G| RIX 
01 52 55 


Figure 34. IEEE 64-bit execution model 
The S bit is the sign bit. 


The C bit is the carry bit, which captures the carry out 
of the significand. 


The L bit is the leading unit bit of the significand, 
which receives the implicit bit from the operand. 


The FRACTION is a 52-bit field that accepts the frac- 
tion of the operand. 


The Guard (G), Round (R), and Sticky (X) bits are 
extensions to the low-order bits of the accumulator. 
The G and R bits are required for postnormalization of 
the result. The G, R, and X bits are required during 
rounding to determine if the intermediate result is 
equally near the two nearest representable values. 
The X bit serves as an extension to the G and R bits 
by representing the logical OR of all bits that may 
appear to the low-order side of the R bit, due either to 
shifting the accumulator right or to other generation 
of low-order result bits. The G and R bits participate 
in the left shifts with zeros being shifted into the R bit. 
Figure 35 shows the significance of the G, R, and X 
bits with respect to the intermediate result (IR), the 
representable number next lower in magnitude (NL), 
and the representable number next higher in magni- 
tude (NH). 


Interpretation 


ooo, meet OSCSC* 


IR closer to NL 


IR midway between NL and NH 


IR closer to NH 





Figure 35. Interpretation of G, R, and X bits 


After normalization, the intermediate result is 
rounded, using the rounding mode specified by 
FPSCRpn. If rounding results in a carry into C, the 
significand is shifted right one position and the expo- 
nent incremented by one. This yields an inexact 
result and possibly also exponent overflow. Fraction 
bits to the left of the bit position used for rounding 
are stored into the FPR and low-order bit positions, if 
any, are set to zero. 


Four user-selectable rounding modes are provided 
through FPSCRey as described in Section 4.3.6, 
“Rounding” on page 89. For rounding, the conceptual 
Guard, Round, and Sticky bits are defined in terms of 
accumulator bits. Figure 36 shows the positions of 
the Guard, Round, and Sticky bits for double-precision 
and single-precision floating-point numbers in the 
IEEE execution model. 


[Guard [Round [Scky 


Double |G bit R bit 
Single 24 25 





X bit 
OR of 26:52, G, R, X 


Figure 36. Location of the Guard, Round, and Sticky 
bits in the IEEE execution model 


Rounding can be treated as though the significand 
were shifted right, if required, until the least signif- 
icant bit to be retained is in the low-order bit position 
of the FRACTION. If any of the Guard, Round, or 
Sticky bits is nonzero, then the result is inexact. 


Z1 and Z2, as defined on page 89, can be used to 
approximate the result in the target format when one 
of the following rules is used. 


= Round to Nearest 


Guard bit = 0 
The result is truncated. (Result exact (GRX = 
000) or closest to next lower value in magni- 
tude (GRX = 001, 010, or 011)) 


Guard bit = 1 
Depends on Round and Sticky bits: 


Case a 
If the Round or Sticky bit is 1 (inclusive), 
the result is incremented. (Result closest 
to next higher value in magnitude (GRX 
= 101, 110, or 111)) 


Case b 

If the Round and Sticky bits are 0 (result 
midway between closest representable 
values), then if the low-order bit of the 
result is 1 the result is incremented. 
Otherwise (the low-order bit of the result 
is 0) the result is truncated (this is the 
case of a tie rounded to even). 


= Round toward Zero 
Choose the smaller in magnitude of Z1 or Z2. If 
the Guard, Round, or Sticky bit is nonzero, the 
result is inexact. 
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= Round toward + Infinity 
Choose 21. 


= Round toward -— Infinity 
Choose 22. 


Where the result is to have fewer than 53 bits of pre- 
cision because the instruction is a Floating Round to 
Single-Precision or single-precision arithmetic instruc- 
tion, the intermediate result is either normalized or 
placed in correct denormalized form before being 
rounded. 


4.5.2 Execution Model for 
Multiply-Add Type Instructions 


The PowerPC Architecture provides a special form of 
instruction that performs up to three operations in 
one instruction (a multiplication, an addition, and a 
negation). With this added capability comes the 
special ability to produce a more exact intermediate 
result as input to the rounder. 32-bit arithmetic is 
similar except that the FRACTION field is smaller. 


Multiply-add significand arithmetic is considered to be 
performed with a floating-point accumulator having 
the following format, where bits 0:106 comprise the 
significand of the intermediate result. 


S}C]L FRACTION X' 
01 105 106 


Figure 37. Multiply-add 64-bit execution model 


The first part of the operation is a multiplication. The 
multiplication has two 53-bit significands as inputs, 
which are assumed to be _ prenormalized, and 
produces a result conforming to the above model. If 
there is a carry out of the significand (into the C bit), 
then the significand is shifted right one position, 
shifting the L bit (leading unit bit) into the most signif- 


icant bit of the FRACTION and shifting the C bit (carry 
out) into the L bit. All 106 bits (L bit, the FRACTION) 
of the product take part in the add operation. If the 
exponents of the two inputs to the adder are not 
equal, the significand of the operand with the smaller 
exponent is aligned (shifted) to the right by an amount 
that is added to that exponent to make it equal to the 
other input's exponent. Zeros are shifted into the left 
of the significand as it is aligned and bits shifted out 
of bit 105 of the significand are ORed into the X' bit. 
The add operation also produces a result conforming 
to the above model with the X' bit taking part in the 
add operation. 


The result of the addition is then normalized, with all 
bits of the addition result, except the X' bit, partic- 
ipating in the shift. The normalized result serves as 
the intermediate result that is input to the rounder. 


For rounding, the conceptual Guard, Round, and 
Sticky bits are defined in terms of accumulator bits. 
Figure 38 shows the positions of the Guard, Round, 
and Sticky bits for double-precision and _ single- 
precision floating-point numbers in the multiply-add 
execution model. 


Format [Guerd [Round [Sto 





Double |53 54 OR of 55:105, X' 
Single | 24 25 OR of 26:105, X' 


Figure 38. Location of the Guard, Round, and Sticky 
bits in the multiply-add execution model 


The rules for rounding the intermediate result are the 
same as those given in Section 4.5.1, “Execution 
Model for IEEE Operations” on page 95. 


If the instruction is Floating Negative Multiply-Add or 
Floating Negative Multiply-Subtract, the final result is 
negated. 
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4.6 Floating-Point Processor Instructions 


4.6.1 Floating-Point Storage Access Instructions 


The Storage Access instructions compute the effective 
address (EA) of the storage to be accessed as 
described in Section 1.12.2, “Effective Address 
Calculation” on page 14. 








Programming Note 


The la extended mnemonic permits computing an 
effective address as a Load or Store instruction 
would, but loads the address itself into a GPR 
rather than loading the value that is in storage at 
that address. This extended mnemonic is 
described in Section B.9, “Miscellaneous 
Mnemonics” on page 153. 








4.6.2 Floating-Point Load Instructions 


There are two basic forms of load instruction: single- 
precision and double-precision. Because the FPRs 
support only floating-point double format, single- 
precision Load Floating-Point instructions convert 
single-precision data to double format prior to loading 
the operand into the target FPR. The conversion and 
loading steps are as follows. 


Let WORD -3; be the floating-point single-precision 
operand accessed from storage. 


Normalized Operand 
if WORD,.. > 0 and WORD,.. < 255 then 
FRT.; ¢ WORD,.; 
FRT, ¢ ~WORD, 
FRT; ¢ -WORD, 
FRT, ¢ ~WORD, 
FRTs.¢3 € WORD».3, I! 290 


Denormalized Operand 
if WORD,., = 0 and WORD,.3; # 0 then 
sign ¢ WORD) 
exp € -— 126 
fraco.so ¢€ 0b0 || WORD,:31 Il 299 
normalize the operand 
do while fracg = 0 
frac ¢ fracy.5o || ObO 
exp € exp — 1 
FRT) € sign 
FRT 4-44 ¢ exp + 1023 
FRTy2.63 € fracy.so 


4.6.1.1 Storage Access Exceptions 


Storage accesses will cause the system data storage 
error handler to be invoked if the program is not 
allowed to modify the target storage (Store only), or if 
the program attempts to access storage that is una- 
vailable. 


Zero / Infinity / NaN 
if WORD,., = 255 or WORD,.3; = 0 then 

FRTo-; € WORDp».; 

FRT> € WORD, 

FRT3 ¢€ WORD, 

FRT, € WORD, 

FRT5.¢3 € WORD».3, I 290 
For double-precision Load Floating-Point instructions 
no conversion is required, as the data from storage 
are copied directly into the FPR. 


Many of the Load Floating-Point instructions have an 
“update” form, in which register RA is updated with 
the effective address. For these forms, if RA#0, the 
effective address is placed into register RA and the 
storage element (word or doubleword) addressed by 
EA is loaded into FRT. 


Note: Recall that RA and RB denote General Purpose 
Registers, while FRT denotes a Floating-Point Reg- 
ister. 
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Load Floating-Point Single D-form 


fs FRT,D(RA) 


if RA = 0 then ai 0 
else € (RA) 
FA ¢b+ aioe 

FRT ¢ DOUBLE (MEM(EA, 4)) 


Let the effective address (EA) be the sum (RA|0)+D 
The word in storage addressed by EA is interpreted 
as a floating-point single-precision operand. This 
word is converted to floating-point double format (see 
page 97) and placed into register FRT. 


Special Registers Altered: 
None 


Load Floating-Point Single with Update 
D-form 


Ifsu FRT,D(RA) 


EA ¢ (RA) + EXTS(D) 
FRT € DOUBLE (MEM(EA, 4)) 
RA €¢ EA 


Let the effective address (EA) be the sum (RA)+D. 
The word in storage addressed by EA is interpreted 
as a floating-point single-precision operand. This 
word is converted to floating-point double format (see 
page 97) and placed into register FRT. 

EA is placed into register RA. 


If RA=0, the instruction form is invalid. 


Special Registers Altered: 
None 
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Load Floating-Point Single Indexed 
X-form 


Ifsx FRT,RA,RB 


31 FRT RA RB 535 / 
0 6 11 16 21 31 


if RA = 0 then oF 0 
else € (RA) 
EA ¢b + (RB) 

FRT ¢ DOUBLE (MEM(EA, 4)) 


Let the effective address (EA) be the sum 
(RA|0O)+(RB). 

The word in storage addressed by EA is interpreted 
as a floating-point single-precision operand. This 
word is converted to floating-point double format (see 
page 97) and placed into register FRT. 


Special Registers Altered: 
None 


Load Floating-Point Single with Update 
Indexed X-form 


Ifsux FRT,RA,RB 


31 FRT RA RB 567 / 
0 6 11 16 21 31 


EA € (RA) + (RB) 
FRT € DOUBLE (MEM(EA, 4)) 
RA ¢ EA 


Let the effective address (EA) be the sum (RA)+(RB). 
The word in storage addressed by EA is interpreted 
as a floating-point single-precision operand. This 
word is converted to floating-point double format (see 
page 97) and placed into register FRT. 

EA is placed into register RA. 


If RA=0, the instruction form is invalid. 


Special Registers Altered: 
None 
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Load Floating-Point Double D-form 


lfd FRT,D(RA) 


if RA = 0 then b €0 

else b € (RA) 

EA ¢ b + EXTS(D) 

FRT € MEM(EA, 8) 

Let the effective address (EA) be the sum (RA|0)+D 


The doubleword in storage addressed by EA is placed 
into register FRT. 


Special Registers Altered: 
None 


Load Floating-Point Double with Update 
D-form 


Ifdu FRT,D(RA) 


EA € (RA) + EXTS(D) 

FRT € MEM(EA, 8) 

RA € EA 

Let the effective address (EA) be the sum (RA)+D. 


The doubleword in storage addressed by EA is placed 
into register FRT. 


EA is placed into register RA. 
If RA=0, the instruction form is invalid. 


Special Registers Altered: 
None 


Load Floating-Point Double Indexed 
X-form 


lfdx FRT,RA,RB 


31 FRT RA RB 599 / 
0 6 11 16 21 31 


if RA = 0 then oe 0 
else ¢€ (RA) 
EA ¢b + (RB) 

FRT ¢ MEM(EA, 8) 


Let the effective address (EA) be the sum 
(RA|0O)+(RB). 


The doubleword in storage addressed by EA is placed 
into register FRT. 


Special Registers Altered: 
None 


Load Floating-Point Double with Update 
Indexed X-form 


Ifdux FRT,RA,RB 


31 FRT RA RB 631 / 
0 6 11 16 21 31 


EA € (RA) + (RB) 

FRT € MEM(EA, 8) 

RA € EA 

Let the effective address (EA) be the sum (RA)+(RB). 


The doubleword in storage addressed by EA is placed 
into register FRT. 


EA is placed into register RA. 
If RA=0, the instruction form is invalid. 


Special Registers Altered: 
None 
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4.6.3 Floating-Point Store Instructions 


There are three basic forms of store instruction: 
single-precision, double-precision, and integer. The 
integer form is provided by the Store Floating-Point as 
Integer Word instruction, described on page 103. 
Because the FPRs support only floating-point double 
format for floating-point data, single-precision Store 
Floating-Point instructions convert double-precision 
data to single format prior to storing the operand into 
storage. The conversion steps are as follows. 


Let WORDp:3; be the word in storage written to. 


No Denormalization Required (includes Zero / Infinity 
/ NaN) 
if FRS4.44 > 896 or FRS4.63 = Othen 

WORD,.; € FRSp-; 

WORD».3; € FRS5.34 


Denormalization Required 
if 874 < FRS,.,4; < 896 then 
sign ¢ FRSo 
exp € FRS,.,, — 1023 
frac € Ob1 Il FRSy5.63 
denormalize operand 
do while exp < — 126 
frac € 0b0 II fracg.go 
exp € exp + 1 
WORD) ¢ sign 
WORD,.g € 0x00 
WORD .3; € fracy.o3 
else WORD ¢ undefined 


Notice that if the value to be stored by a single- 
precision Store Floating-Point instruction is larger in 
magnitude than the maximum number representable 
in single format, the first case above (No Denormal- 
ization Required) applies. The result stored in WORD 
is then a well-defined value, but is not numerically 
equal to the value in the source register (i.e., the 
result of a single-precision Load Floating-Point from 
WORD will not compare equal to the contents of the 
Original source register). 


For double-precision Store Floating-Point instructions 
and for the Store Floating-Point as Integer Word 
instruction no conversion is required, as the data 
from the FPR are copied directly into storage. 


Many of the Store Floating-Point instructions have an 
“update” form, in which register RA is updated with 
the effective address. For these forms, if RA#0, the 
effective address is placed into register RA. 


Note: Recall that RA and RB denote General Purpose 
Registers, while FRS denotes a Floating-Point Reg- 
ister. 
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Store Floating-Point Single D-form 


stfs FRS,D(RA) 
FRS 
31 
if RA = 0 then ee 0 
else € (RA) 


EA ¢b + eenein; 
MEM(EA, 4)  SINGLE((FRS) ) 


Let the effective address (EA) be the sum (RA|0)+D 
The contents of register FRS are converted to single 
format (see page 100) and stored into the word in 


storage addressed by EA. 


Special Registers Altered: 
None 


Store Floating-Point Single with Update 
D-form 


stfsu FRS,D(RA) 
FRS 
31 
EA ¢ (RA) + EXTS(D) 
MEM(EA, 4) € SINGLE ((FRS) ) 
RA ¢ EA 


Let the effective address (EA) be the sum (RA)+D. 
The contents of register FRS are converted to single 
format (see page 100) and stored into the word in 
storage addressed by EA. 

EA is placed into register RA. 


If RA=0, the instruction form is invalid. 


Special Registers Altered: 
None 


Store Floating-Point Single Indexed 
X-form 


stfsx FRS,RA,RB 

31 FRS RA RB 663 / 
0 6 11 16 21 31 
if RA = 0 then oe 0 
else € (RA) 
EA ¢b + (RB) 


MEM(EA, 4) € SINGLE((FRS) ) 


Let the effective 
(RA|0O)+(RB). 


address (EA) be the sum 


The contents of register FRS are converted to single 
format (see page 100) and stored into the word in 
storage addressed by EA. 


Special Registers Altered: 
None 


Store Floating-Point Single with Update 
Indexed X-form 


stfsux FRS,RA,RB 

31 FRS RA RB 695 / 
0 6 11 16 21 31 
EA ¢ (RA) + (RB) 
MEM(EA, 4) € SINGLE ((FRS) ) 
RA ¢ EA 


Let the effective address (EA) be the sum (RA)+(RB). 
The contents of register FRS are converted to single 
format (see page 100) and stored into the word in 
storage addressed by EA. 

EA is placed into register RA. 


If RA=0, the instruction form is invalid. 


Special Registers Altered: 
None 
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Store Floating-Point Double D-form 


stfd FRS,D(RA) 


FRS 
31 


if RA = 0 then ei 0 

else € (RA) 

EA ¢b + Hae 

MEM(EA, 8) ¢ (FRS) 

Let the effective address (EA) be the sum (RA|0)+D 


The contents of register FRS are stored into the 
doubleword in storage addressed by EA. 


Special Registers Altered: 
None 


Store Floating-Point Double with Update 
D-form 


stfdu FRS,D(RA) 


FRS 
31 


EA € (RA) + EXTS(D) 

MEM(EA, 8) ¢€ (FRS) 

RA ¢ EA 

Let the effective address (EA) be the sum (RA)+D. 


The contents of register FRS are stored into the 
doubleword in storage addressed by EA. 


EA is placed into register RA. 
If RA=0, the instruction form is invalid. 


Special Registers Altered: 
None 
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Store Floating-Point Double Indexed 
X-form 


stfdx FRS,RA,RB 


31 FRS RA RB 727 / 
0 6 11 16 21 31 


if RA = 0 then or 0 
else € (RA) 
EA ¢b + (RB) 

MEM(EA, 8) ¢ (FRS) 


Let the effective address (EA) be the sum 
(RA|0)+(RB). 


The contents of register FRS are stored into the 
doubleword in storage addressed by EA. 


Special Registers Altered: 
None 


Store Floating-Point Double with Update 
Indexed X-form 


stfdux FRS,RA,RB 


31 FRS RA RB 759 / 
0 6 11 16 21 31 


EA € (RA) + (RB) 

MEM(EA, 8) ¢ (FRS) 

RA € EA 

Let the effective address (EA) be the sum (RA)+(RB). 


The contents of register FRS are stored into the 
doubleword in storage addressed by EA. 


EA is placed into register RA. 
If RA=0, the instruction form is invalid. 


Special Registers Altered: 
None 
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Store Floating-Point as Integer Word 
Indexed X-form 


stfiwx FRS,RA,RB 


31 FRS RA RB 983 / 
0 6 11 16 21 31 


if RA = 0 then b €0 
else b ¢€ ( 
EA ¢€b + (RB) 

MEM(EA, 4) € (FRS) 30.63 


RA) 


Let the effective address (EA) be the sum 
(RA|O)+(RB). 


The contents of the low-order 32 bits of register FRS 
are stored, without conversion, into the word in 
storage addressed by EA. 


If the contents of register FRS were produced, either 
directly or indirectly, by a Load Floating-Point Single 
instruction, a single-precision Arithmetic instruction, 
or frsp, then the value stored is undefined. (The con- 
tents of register FRS are produced directly by such an 
instruction if FRS is the target register for the instruc- 
tion. The contents of register FRS are produced indi- 
rectly by such an instruction if FRS is the final target 
register of a sequence of one or more Floating-Point 
Move instructions, with the input to the sequence 
having been produced directly by such an instruction.) 


Special Registers Altered: 
None 
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4.6.4 Floating-Point Move Instructions 


These instructions copy data from one floating-point 
register to another, altering the sign bit (bit 0) as 
described below for fneg, fabs, and fnabs. These 
instructions treat NaNs just like any other kind of 


Floating Move Register X-form 


fmr FRT,FRB (Rc=0) 
fmr. FRT,FRB (Ro=1) 


63 FRT //I FRB 72 Re 
0 6 1 16 21 31 


The contents of register FRB are placed into register 
FRT. 


Special Registers Altered: 
CR1 (if Re=1) 


Floating Absolute Value X-form 


fabs FRT,FRB (Rc=0) 
fabs. FRT,FRB (Rc=1) 

63 FRT /// FRB 264 Re 
0 6 11 16 21 31 


The contents of register FRB with bit 0 set to zero are 
placed into register FRT. 


Special Registers Altered: 
CR1 (if Re=1) 


value (e.g., the sign bit of a NaN may be altered by 
fneg, fabs, and fnabs). These instructions do not alter 
the FPSCR. 


Floating Negate X-form 


fneg FRT,FRB 
fneg. FRT,FRB 


63 FRT //1 FRB 40 Re 
0 6 11 16 21 31 


The contents of register FRB with bit 0 inverted are 
placed into register FRT. 


Special Registers Altered: 
CR1 (if Re=1) 


Floating Negative Absolute Value 
X-form 


fnabs FRT,FRB (Rc=0) 
fnabs. FRT,FRB (Rc=1) 


63 FRT HI FRB 136 Re 
0 6 11 16 21 31 


The contents of register FRB with bit 0 set to one are 
placed into register FRT. 


Special Registers Altered: 
CR1 (if Re=1) 
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4.6.5 Floating-Point Arithmetic Instructions 


4.6.5.1 Floating-Point Elementary Arithmetic Instructions 


Floating Add [Single] A-form 


fadd FRT,FRA,FRB 
fadd. FRT,FRA,FRB 


[POWER mnemonics: fa, fa.] 


63 FRT FRA FRB //1 21 |Re 
0 6 14 16 21 26 31 


fadds 
fadds. 


FRT,FRA,FRB 
FRT,FRA,FRB 


59 FRT FRA FRB //1 21 |Re 
0 6 11 16 21 26 31 


The floating-point operand in register FRA is added to 
the floating-point operand in register FRB. 


If the most significant bit of the resultant significand is 
not 1, the result is normalized. The result is rounded 
to the target precision under control of the Floating- 
Point Rounding Control field RN of the FPSCR and 
placed into register FRT. 


Floating-point addition is based on exponent compar- 
ison and addition of the two significands. The expo- 
nents of the two operands are compared, and the 
significand accompanying the smaller exponent is 
shifted right, with its exponent increased by one for 
each bit shifted, until the two exponents are equal. 
The two significands are then added or subtracted as 
appropriate, depending on the signs of the operands, 
to form an intermediate sum. All 53 bits of the 
significand as well as all three guard bits (G, R, and 
X) enter into the computation. 


If a carry occurs, the sum's significand is shifted right 
one bit position and the exponent is increased by one. 


FPSCRrppr is set to the class and sign of the result, 


except for Invalid Operation Exceptions when 
FPSCRyg= 1. 
Special Registers Altered: 

FPRF FR Fl 

FX OX UX XX 

VXSNAN VXISI 

CR1 (if Re=1) 


Floating Subtract [Single] A-form 


fsub FRT,FRA,FRB 
fsub. FRT,FRA,FRB 


[POWER mnemonics: fs, fs.] 


63 FRT FRA FRB II 20 |Rc 
0 6 11 16 21 26 31 


fsubs FRT,FRA,FRB (Re=0) 
fsubs. FRT,FRA,FRB (Re=1) 

59 FRT | FRA | FRB | /// | 20 |Re 
0 6 11 16 21 26 31 


The floating-point operand in register FRB is sub- 
tracted from the floating-point operand in register 
FRA. 


If the most significant bit of the resultant significand is 
not 1, the result is normalized. The result is rounded 
to the target precision under control of the Floating- 
Point Rounding Control field RN of the FPSCR and 
placed into register FRT. 


The execution of the Floating Subtract instruction is 
identical to that of Floating Add, except that the con- 
tents of FRB participate in the operation with the sign 
bit (bit 0) inverted. 


FPSCRrppr is set to the class and sign of the result, 


except for Invalid Operation Exceptions when 
FPSCRye= 1. 
Special Registers Altered: 

FPRF FR Fl 

FX OX UX XX 

VXSNAN VXISI 

CR1 (if Re=1) 
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Floating Multiply [Single] A-form 


fmul FRT,FRA,ERC (Rc=0) 
fmul. FRT,FRA,ERC (Rc=1) 


[POWER mnemonics: fm, fm.] 


63 FRT FRA //f FRC 25 |Rc 
0 6 11 16 21 26 31 


fmuls FRT,FRA,FRC = 
fmuls. FRT,FRA,FRC (Rc=1) 


59 FRT FRA //I FRC 25 |Rc 
0 6 14 16 21 26 31 


The floating-point operand in register FRA is multi- 
plied by the floating-point operand in register FRC. 


If the most significant bit of the resultant significand is 
not 1, the result is normalized. The result is rounded 
to the target precision under control of the Floating- 
Point Rounding Control field RN of the FPSCR and 
placed into register FRT. 


Floating-point multiplication is based on exponent 
addition and multiplication of the significands. 


FPSCReppr is set to the class and sign of the result, 


except for Invalid Operation Exceptions when 
FPSCRye= 1. 
Special Registers Altered: 

FPRF FR Fl 

FX OX UX XX 

VXSNAN_ VXIMZ 

CR1 (if Re=1) 


Floating Divide [Single] A-form 


fdiv FRT,FRA,FRB = 
fdiv. FRT,FRA,FRB (Rc=1) 


[POWER mnemonics: fd, fd.] 


63 FRT FRA FRB //I 18 |Re 
0 6 14 16 21 26 31 


fdivs FRT,FRA,FRB = 
fdivs. FRT,FRA,FRB (Rc=1) 


59 FRT FRA FRB //I 18 |Re 
0 6 14 16 21 26 31 


The floating-point operand in register FRA is divided 
by the floating-point operand in register FRB. The 
remainder is not supplied as a result. 


If the most significant bit of the resultant significand is 
not 1, the result is normalized. The result is rounded 
to the target precision under control of the Floating- 
Point Rounding Control field RN of the FPSCR and 
placed into register FRT. 


Floating-point division is based on exponent sub- 
traction and division of the significands. 


FPSCRrppr is set to the class and sign of the result, 


except for Invalid Operation Exceptions when 
FPSCRye=1 and Zero Divide Exceptions when 
FPSCR7¢=1. 


Special Registers Altered: 
FPRF FR FI 
FX OX UX ZX XX 
VXSNAN VXIDI VXZDZ 
CR1 (if Re=1) 
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4.6.5.2 Floating-Point Multiply-Add Instructions 


These instructions combine a multiply and an add 
operation without an intermediate rounding operation. 
The fraction part of the intermediate product is 106 
bits wide (L bit, FRACTION), and all 106 bits take part 
in the add/subtract portion of the instruction. 


Status bits are set as follows. 


= Overflow, Underflow, and Inexact Exception bits, 
the FR and FI bits, and the FPRF field are set 


Floating Multiply-Add | Single] A-form 


fmadd FRT,FRA,FRC,FRB (Rc=0) 
fmadd. FRT,FRA,FRC,FRB (Rc=1) 
[POWER mnemonics: fma, fma.] 

63 FRT FRA FRB FRC 29 |Rc 
0 6 11 16 21 26 31 
fmadds FRT,FRA,FRC,FRB (Rc=0) 
fmadds. FRT,FRA,FRC,FRB (Rc=1) 


59 FRT FRA FRB FRC 29 |Rc 
0 6 14 16 21 26 31 


The operation 
FRT ¢ [(FRA)x(FRC)] + (FRB) 
is performed. 


The floating-point operand in register FRA is multi- 
plied by the floating-point operand in register FRC. 
The floating-point operand in register FRB is added to 
this intermediate result. 


If the most significant bit of the resultant significand is 
not 1, the result is normalized. The result is rounded 
to the target precision under control of the Floating- 
Point Rounding Control field RN of the FPSCR and 
placed into register FRT. 


FPSCRrppr is set to the class and sign of the result, 


except for Invalid Operation Exceptions when 
FPSCRyep= 1. 
Special Registers Altered: 

FPRF FR Fl 

FX OX UX XX 

VXSNAN_ VXISI VXIMZ 

CR1 (if Re=1) 


based on the final result of the operation, and not 
on the result of the multiplication. 


a Invalid Operation Exception bits are set as if the 
multiplication and the addition were performed 
using two separate instructions (fmull[ s], followed 
by fadd[s] or fsub[s]). That is, multiplication of 
infinity by O or of anything by an SNaN, and/or 
addition of an SNaN, cause the corresponding 
exception bits to be set. 


Floating Multiply-Subtract [ Single] 
A-form 


FRT,FRA,FRC,FRB 
FRT,FRA,FRC,FRB 


[POWER mnemonics: fms, fms.] 


63 FRT FRA FRB FRC 28 |Rc 
0 6 11 16 21 26 31 


FRT,FRA,FRC,FRB 
FRT,FRA,FRC,FRB 


fmsub 
fmsub. 


fmsubs 
fmsubs. 


59 FRT FRA FRB FRC 28 |Rc 
0 6 14 16 21 26 31 


The operation 
FRT ¢ [(FRA)x(FRC)] — (FRB) 
is performed. 


The floating-point operand in register FRA is multi- 
plied by the floating-point operand in register FRC. 
The floating-point operand in register FRB is sub- 
tracted from this intermediate result. 


If the most significant bit of the resultant significand is 
not 1, the result is normalized. The result is rounded 
to the target precision under control of the Floating- 
Point Rounding Control field RN of the FPSCR and 
placed into register FRT. 


FPSCRrppr is set to the class and sign of the result, 


except for Invalid Operation Exceptions when 
FPSCRye= 1. 
Special Registers Altered: 

FPRF FR Fl 

FX OX UX XX 

VXSNAN VXISI VXIMZ 

CR1 (if Rc=1) 
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Floating Negative Multiply-Add [ Single] 
A-form 


fnmadd 
fnmadd. 


FRT,FRA,FRC,FRB 
FRT,FRA,FRC,FRB 


[POWER mnemonics: fnma, fnma.] 


63 FRT FRA FRB FRC 31 |Re 


0 6 11 16 21 26 31 
fnmadds = FRT,FRA,FRC,FRB (Rc=0) 
fnmadds. FRT,FRA,FRC,FRB (Rc=1) 


59 FRT FRA FRB FRC 31 |Re 
0 6 11 16 21 26 31 


The operation 
FRT ¢ — ([(FRA)x(FRC)] + (FRB) ) 
is performed. 


The floating-point operand in register FRA is multi- 
plied by the floating-point operand in register FRC. 
The floating-point operand in register FRB is added to 
this intermediate result. 


If the most significant bit of the resultant significand is 
not 1, the result is normalized. The result is rounded 
to the target precision under control of the Floating- 
Point Rounding Control field RN of the FPSCR, then 
negated and placed into register FRT. 


This instruction produces the same result as would be 
obtained by using the Floating Multiply-Add instruc- 
tion and then negating the result, with the following 
exceptions. 


m QNaNs propagate with no effect on their “sign” 
bit. 

m= QNaNs that are generated as the result of a disa- 
bled Invalid Operation Exception have a “sign” bit 
of 0. 

m SNaNs that are converted to QNaNs as the result 
of a disabled Invalid Operation Exception retain 
the “sign” bit of the SNaN. 


FPSCRrppr is set to the class and sign of the result, 
except for Invalid Operation Exceptions when 
FPSCRye= 1. 


Special Registers Altered: 
FPRF FR FI 
FX OX UX XX 
VXSNAN VXISI VXIMZ 
CR1 (if Re=1) 


Floating Negative Multiply-Subtract 
[Single] A-form 


fnmsub FRT,FRA,FRC,FRB (Rc=0) 
fnmsub. FRT,FRA,FRC,FRB (Rc=1) 
[POWER mnemonics: fnms, fnms.] 

63 FRT FRA FRB FRC 30 |Rec 
0 6 11 16 21 26 31 
fnmsubs FRT,FRA,FRC,FRB (Rc=0) 
fnmsubs. FRT,FRA,FRC,FRB (Rc=1) 


59 FRT FRA FRB FRC 30 |Re 
0 6 11 16 21 26 31 


The operation 
FRT ¢ — ([(FRA)x(FRC)] — (FRB) ) 
is performed. 


The floating-point operand in register FRA is multi- 
plied by the floating-point operand in register FRC. 
The floating-point operand in register FRB is sub- 
tracted from this intermediate result. 


If the most significant bit of the resultant significand is 
not 1, the result is normalized. The result is rounded 
to the target precision under control of the Floating- 
Point Rounding Control field RN of the FPSCR, then 
negated and placed into register FRT. 


This instruction produces the same result as would be 
obtained by using the Floating Multiply-Subtract 
instruction and then negating the result, with the fol- 
lowing exceptions. 


m QNaNs propagate with no effect on their “sign” 
bit. 

m= QNaNs that are generated as the result of a disa- 
bled Invalid Operation Exception have a “sign” bit 
of 0. 

m= SNaNs that are converted to QNaNs as the result 
of a disabled Invalid Operation Exception retain 
the “sign” bit of the SNaN. 


FPSCRrppr is set to the class and sign of the result, 
except for Invalid Operation Exceptions when 
FPSCRye= 1. 


Special Registers Altered: 
FPRF FR FI 
FX OX UX XX 
VXSNAN VXISI VXIMZ 
CR1 (if Re=1) 
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4.6.6 Floating-Point Rounding and Conversion Instructions 


Programming Note 





Examples of uses of these instructions to perform 
various conversions can be found in Section C.2, 
“Floating-Point Conversions” on page 158. 





Floating Round to Single-Precision 
X-form 


frsp FRT,FRB (Rc=0) 
frsp. FRT,FRB (Rc=1) 


63 FRT //I FRB 12 Re 
0 6 11 16 21 31 


The floating-point operand in register FRB is rounded 
to single-precision, using the rounding mode specified 
by FPSCRan, and placed into register FRT. 


The rounding is described fully in Section A.1, 
“Floating-Point Round to Single-Precision Model” on 
page 133. 


FPSCRrppr is set to the class and sign of the result, 
except for Invalid Operation Exceptions when 
FPSCRyep=1. 


Special Registers Altered: 
FPRF FR FI 
FX OX UX XX 
VXSNAN 
CR1 (if Re=1) 
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Floating Convert To Integer Doubleword 
X-form 


fctid FRT,FRB (Rc=0) 
fetid. FRT,FRB (Rc=1) 


63 FRT //I FRB 814 Re 
0 6 11 16 21 31 


The floating-point operand in register FRB is con- 
verted to a 64-bit signed fixed-point integer, using the 
rounding mode specified by FPSCRan, and placed into 
register FRT. 


If the operand in FRB is greater than 2®- 1, then 
FRT is set to Ox/7FFF_FFFF_FFFF_FFFF. If the 
operand in FRB is less than — 26, then FRT is set to 
0x8000_0000_0000_0000. 


The conversion is described fully in Section A.2, 
“Floating-Point Convert to Integer Model” on 
page 138. 


Except for enabled Invalid Operation Exceptions, 
FPSCReppr is undefined. FPSCRep is set if the result 
is incremented when rounded. FPSCRe is set if the 
result is inexact. 


Special Registers Altered: 
FPRF (undefined) FR FI 
FX XX 
VXSNAN VXCVI 
CR1 (if Re=1) 


Floating Convert To Integer Doubleword 
with round toward Zero X-form 


fctidz FRT,FRB 
fctidz. FRT,FRB 


63 FRT HI FRB 815 Re 
0 6 11 16 21 31 


The floating-point operand in register FRB is con- 
verted to a 64-bit signed fixed-point integer, using the 
rounding mode Round toward Zero, and placed into 
register FRT. 


If the operand in FRB is greater than 2- 1, then 
FRT is set to Ox/7FFF_FFFF_FFFF_FFFF. If the 
operand in FRB is less than — 2, then FRT is set to 
0x8000_0000_0000_0000. 


The conversion is described fully in Section A.2, 
“Floating-Point Convert to Integer Model” on 
page 138. 


Except for enabled Invalid Operation Exceptions, 
FPSCReprr is undefined. FPSCRep is set if the result 
is incremented when rounded. FPSCRr, is set if the 
result is inexact. 


Special Registers Altered: 
FPRF (undefined) FR FI 
FX XX 
VXSNAN VXCVI 
CR1 (if Re=1) 
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Floating Convert To Integer Word 
X-form 


fctiw FRT,FRB (Rc=0) 
fctiw. FRT,FRB (Rc=1) 
[POWER2 mnemonics: fcir, fcir.] 

63 FRT /// FRB 14 Re 
0 6 11 16 21 31 


The floating-point operand in register FRB is con- 
verted to a 32-bit signed fixed-point integer, using the 
rounding mode specified by FPSCRan, and placed into 
FRT39.63- The contents of FRTo.3, are undefined. 


If the operand in FRB is greater than 29! — 1, then bits 
32:63 of FRT are set to Ox7FFF_FFFF. If the operand 
in FRB is less than — 25’, then bits 32:63 of FRT are 
set to 0x8000_0000. 


The conversion is described fully in Section A.2, 
“Floating-Point Convert to Integer Model” on 
page 138. 


Except for enabled Invalid Operation Exceptions, 
FPSCReppr is undefined. FPSCRep is set if the result 
is incremented when rounded. FPSCRe, is set if the 
result is inexact. 


Special Registers Altered: 
FPRF (undefined) FR FI 
FX XX 
VXSNAN VXCVI 
CR1 (if Re=1) 


Floating Convert To Integer Word with 
round toward Zero X-form 


fctiwz FRT,FRB (Rc=0) 
fctiwz. FRT,FRB (Rc=1) 
[POWER2 mnemonics: fcirz, fcirz.] 

63 FRT /// FRB 15 Re 
0 6 11 16 21 31 


The floating-point operand in register FRB is con- 
verted to a 32-bit signed fixed-point integer, using the 
rounding mode Round toward Zero, and placed into 
FRT39.63- The contents of FRTo.3, are undefined. 


If the operand in FRB is greater than 2' — 1, then bits 
32:63 of FRT are set to OxX7FFF_FFFF. If the operand 
in FRB is less than — 23!, then bits 32:63 of FRT are 
set to 0x8000_0000. 


The conversion is described fully in Section A.2, 
“Floating-Point Convert to Integer Model” on 
page 138. 


Except for enabled Invalid Operation Exceptions, 
FPSCReppr is undefined. FPSCRep is set if the result 
is incremented when rounded. FPSCRr, is set if the 
result is inexact. 


Special Registers Altered: 
FPRF (undefined) FR FI 
FX XX 
VXSNAN VXCVI 
CR1 (if Re=1) 
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Floating Convert From Integer 
Doubleword X-form 


fcfid FRT,FRB (Re=0) 
fefid. FRT,FRB (Ro=1) 


63 FRT HI FRB 846 Re 
0 6 11 16 21 31 


The 64-bit signed fixed-point operand in register FRB 
is converted to an infinitely precise floating-point 
integer. The result of the conversion is rounded to 
double-precision, using the rounding mode specified 
by FPSCRan, and placed into register FRT. 


The conversion is described fully in Section A.3, 
“Floating-Point Convert from Integer Model” on 
page 141. 


FPSCRrppr is set to the class and sign of the result. 
FPSCRre is set if the result is incremented when 
rounded. FPSCRg, is set if the result is inexact. 


Special Registers Altered: 
FPRF FR FI 
FX XX 
CR1 (if Re=1) 
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4.6.7 Floating-Point Compare Instructions 


The floating-point Compare instructions compare the 
contents of two floating-point registers. Comparison 
ignores the sign of zero (i.e., regards +0 as equal to 
—0). The comparison can be ordered or unordered. 


The comparison sets one bit in the designated CR 
field to 1 and the other three to 0. The FPCC is set in 
the same way. 


Floating Compare Unordered X-form 


fempu BF,FRA,FRB 


63 BF | //| FRA FRB 0 / 
0 6 Q |i 16 21 31 


if (FRA) is a NaN or 

(FRB) is a NaN then c € 0b0001 
else if (FRA) < (FRB) then c ¢ 0b1000 
else if (FRA) > (FRB) then c ¢ 0b0100 
else c € 0b0010 
FPCC ¢c¢ 
CRaxBF:4xBF+3 © C 
if (FRA) is an SNaN or 

(FRB) is an SNaN then 

VXSNAN € 1 


The floating-point operand in register FRA is com- 
pared to the floating-point operand in register FRB. 
The result of the compare is placed into CR field BF 
and the FPCC. 


If either of the operands is a NaN, either quiet or sig- 
naling, then CR field BF and the FPCC are set to 
reflect unordered. If either of the operands is a Sig- 
naling NaN, then VXSNAN is set. 


Special Registers Altered: 
CR field BF 
FPCC 
FX 
VXSNAN 


The CR field and the FPCC are set as follows. 


Bit Name _ Description 

0 FL (FRA) < (FRB) 

1 FG (FRA) > (FRB) 

2 FE (FRA) = (FRB) 

3 FU (FRA) ? (FRB) (unordered) 


Floating Compare Ordered X-form 


fempo BF,FRA,FRB 


63 BF | //| FRA FRB 32 / 
0 6 9 |11 16 21 31 


if (FRA) is a NaN or 
(FRB) is a NaN then c € 0b0001 
else if (FRA) < (FRB) then c ¢ 0b1000 
else if (FRA) > (FRB) then c ¢ 0b0100 
else c € 0b0010 
FPCC ¢¢ 
CRaxBF:4xBF+3 © C 
if (FRA) is an SNaN or 
(FRB) is an SNaN then 
VXSNAN € 1 
if VE = 0 then VxvVC #1 
else if (FRA) is a QNaN or 
(FRB) is a QNaN then VXVC #1 


The floating-point operand in register FRA is com- 
pared to the floating-point operand in register FRB. 
The result of the compare is placed into CR field BF 
and the FPCC. 


If either of the operands is a NaN, either quiet or sig- 
naling, then CR field BF and the FPCC are set to 
reflect unordered. If either of the operands is a Sig- 
naling NaN, then VXSNAN is set and, if Invalid Opera- 
tion is disabled (VE=0), VXVC is set. If neither 
operand is a Signaling NaN but at least one operand 
is a Quiet NaN, then VXVC is set. 


Special Registers Altered: 
CR field BF 
FPCC 
FX 
VXSNAN VXVC 
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4.6.8 Floating-Point Status and Control Register Instructions 


Every Floating-Point Status and Control Register 
instruction synchronizes the effects of all floating- 
point instructions executed by a given processor. 
Executing a Floating-Point Status and Control Register 
instruction ensures that all floating-point instructions 
previously initiated by the given processor have com- 
pleted before the Floating-Point Status and Control 
Register instruction is initiated, and that no subse- 
quent floating-point instructions are initiated by the 
given processor until the Floating-Point Status and 
Contro! Register instruction has completed. In partic- 
ular: 


a All exceptions that will be caused by the previ- 
ously initiated instructions are recorded in the 


Move From FPSCR X-form 


mffs FRT (Re=0) 
mits. FRT (Rc=1) 

63 FRT Mit MI 583 Re 
0 6 i 16 21 31 


The contents of the FPSCR are placed into FRT30.¢. 
The contents of FRT9.3; are undefined. 


Special Registers Altered: 
CR1 (if Re=1) 


FPSCR before the Floating-Point Status and 
Contro! Register instruction is initiated. 


a All invocations of the system floating-point 
enabled exception error handler that will be 
caused by the previously initiated instructions 
have occurred before the Floating-Point Status 
and Control Register instruction is initiated. 


= No subsequent floating-point instruction that 
depends on or alters the settings of any FPSCR 
bits is initiated until the Floating-Point Status and 
Control Register instruction has completed. 


(Floating-point Storage Access instructions are not 
affected.) 


Move to Condition Register from FPSCR 
X-form 


merfs BF,BFA 


63 BF |//| BFA}//)  /// 64 / 
0 6 9 | 11 14] 16 21 31 


The contents of FPSCR field BFA are copied to Condi- 
tion Register field BF. All exception bits copied are 
set to 0 in the FPSCR. If the FX bit is copied, it is set 
to 0 in the FPSCR. 


Special Registers Altered: 


CR field BF 
FX OX (if BEA=0) 
UX ZX XX VXSNAN (if BEA=1) 
VXISI VXIDI VXZDZ VXIMZ (if BEA=2) 
VXVC (if BEA=3) 
VXSOFT VXSQRT VXCVI (if BFA=5) 
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Move To FPSCR Field Immediate 
X-form 


mtfsfi BF,U 
mifsfi. BF,U 


63 BF }//|  /// U |/ 134 Re 
0 6 9} 44 16 2q 21 31 


The value of the U field is placed into FPSCR field BF. 
FPSCRry is altered only if BF = 0. 
Special Registers Altered: 


FPSCR field BF 
CRt (if Re=1) 





Programming Note 


When FPSCR_-3 is specified, bits 0 (FX) and 3 (OX) 
are set to the values of Uy and Us (i.e., even if 
this instruction causes OX to change from 0 to 1, 
FX is set from Up and not by the usual rule that 
FX is set to 1 when an exception bit changes from 
0 to 1). Bits 1 and 2 (FEX and VX) are set 
according to the usual rule, given on page 83, and 
not from Uj.5. 








Move To FPSCR Fields XFL-form 


mtfsf FLM,FRB (Rc=0) 
mtfsf. FLM,FRB (Re=1) 

63 |/| FLM_ |/| FRB 711 Re 
0 6) 7 19 16 21 31 


The contents of bits 32:63 of register FRB are placed 
into the FPSCR under control of the field mask speci- 
fied by FLM. The field mask identifies the 4-bit fields 
affected. Let i be an integer in the range 0-7. If 
FLM,;=1 then FPSCR field i (FPSCR bits 4xi:4xi+3) is 
set to the contents of the corresponding field of the 
low-order 32 bits of register FRB. 


FPSCRry is altered only if FLMy = 1. 
Special Registers Altered: 
FPSCR fields selected by mask 
CR1 (if Re=1) 


Programming Note 


Updating fewer than all eight fields of the FPSCR 
may have substantially poorer performance on 
some implementations than updating all the fields. 








Programming Note 


When FPSCR,-3 is specified, bits 0 (FX) and 3 (OX) 
are set to the values of (FRB)35. and (FRB)zz (i-e., 
even if this instruction causes OX to change from 
0 to 1, FX is set from (FRB)35 and not by the usual 
rule that FX is set to 1 when an exception bit 
changes from 0 to 1). Bits 1 and 2 (FEX and VX) 
are set according to the usual rule, given on page 
83, and not from (FRB)33.34. 
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Move To FPSCR Bit 0 X-form 


mtfsb0 BT (Rc=0) 
mtfsb0. BT (Rc=1) 


63 BT HI HI 70 Re 
0 6 11 16 21 31 


Bit BT of the FPSCR is set to 0. 


Special Registers Altered: 
FPSCR bit BT 
CR1 (if Re=1) 


Programming Note 


Bits 1 and 2 (FEX and VX) cannot be explicitly 
reset. 












Move To FPSCR Bit 1 X-form 


mtfsb1 BT 
mtfsb1. BT 


63 BT HI HI 38 Re 
0 6 11 16 21 31 


Bit BT of the FPSCR is set to 1. 
Special Registers Altered: 


FPSCR bits BT and FX 
CRt (if Re=1) 


Programming Note 
Bits 1 and 2 (FEX and VX) cannot be explicitly set. 
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5.1 Fixed-Point Processor Instructions 


5.1.1 Move To/From System Register Instructions 


The optional versions of the Move To Condition Reg- 
ister Field and Move From Condition Register 
instructions move to or from a single CR field. 


Move To One Condition Register Field 
XFX-form 


mtocrf FXM,RS 
0 6 11] 12 20/21 31 
count ¢ 0 
do i=0to7 
if FXM, = 1 then 
n¢i 


count € count + 1 
if count = 1 then CRarnacns3 € (RS) 3244xn:3244xn+3 
else CR ¢ undefined 


If exactly one bit of the FXM field is set to 1, let n be 
the position of that bit in the field (0 < n< 7). The 
contents of bits 32+4xn:32+4xn+3 of register RS 
are placed into CR field n (CR bits 4xn:4xn+3). Oth- 
erwise, the contents of the Condition Register are 
undefined. 


Special Registers Altered: 
CR field selected by FXM 


Programming Note 


These forms of the mterf and mfer instructions are 
intended to replace the old forms of the 
instructions (the forms shown in Section 3.3.13), 
which will eventually be phased out of the archi- 
tecture. The new forms are backward compatible 
with most processors that comply with versions of 
the architecture that precede Version 2.00. On 
those processors, the new forms are treated as 
the old forms. 


However, on some processors that comply with 
versions of the architecture that precede Version 
2.00 the new forms may be treated as follows: 


mtocrf: may cause the system illegal instruction 
error handler to be invoked 

mfocrf: may copy the contents of an SPR, pos- 
sibly a privileged SPR, into register RT 








Move From One Condition Register 
Field XFX-form 


mfocrf RT,FXM 


| 
0 6 11112 20/21 31 


RT € undefined 


count ¢€ 0 
do i= 0 to7 
if FXM, = 1 then 
n¢i 


__ count € count + 1 
if count = 1 then Rlgoyayn:32eaxn+3 € CRaxn:axns3 


If exactly one bit of the FXM field is set to 1, let n be 
the position of that bit in the field (0< n< 7). The 
contents of CR field n (CR bits 4xn:4xn+3) are placed 
into bits 32+4xn:32+4xn+3 of register RT and the 
contents of the remaining bits of register RT are 
undefined. Otherwise, the contents of register RT are 
undefined. 


Special Registers Altered: 
None 
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5.2 Floating-Point Processor Instructions 


The optional instructions described in this section are = General Purpose group: fsqrt, fsqrts 
divided into two groups. Additional groups may be = Graphics group: fres, frsqrte, fsel 


BSUNS ESI Teun: An implementation that claims to support a given 


group implements all the instructions in the group. 
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5.2.1 Floating-Point Arithmetic Instructions 


5.2.1.1 Floating-Point Elementary Arithmetic Instructions 


Floating Square Root [Single] A-form 


fsqrt FRT,FRB (Rc=0) 
fsqrt. FRT,FRB (Rc=1) 

63 FRT /// FRB If 22 |Rc 
0 6 11 16 21 26 31 
fsqrts FRT,FRB (Rc=0) 
fsqrts. FRT,FRB (Rc=1) 


59 FRT HI FRB //1 22 |Rc 
0 6 11 16 21 26 31 


The square root of the floating-point operand in reg- 
ister FRB is placed into register FRT. 


If the most significant bit of the resultant significand is 
not 1, the result is normalized. The result is rounded 
to the target precision under control of the Floating- 
Point Rounding Control field RN of the FPSCR and 
placed into register FRT. 


Operation with various special values of the operand 
is summarized below. 


Operand Result Exception 
co QNaNn! VXSORT 

< 0 QNaN! VXSORT 

—0 —0 None 

+00 +00 None 

SNaN QNaNn! VXSNAN 
QNaN QNaN None 


'No result if FPSCRye = 1. 


FPSCRrppr is set to the class and sign of the result, 


except for Invalid Operation Exceptions when 
FPSCRyep= 1. 
Special Registers Altered: 

FPRF FR Fl 

FX XX 

VXSNAN VXSQRT 

CR1 (if Rc=1) 


Floating Reciprocal Estimate Single 
A-form 


fres FRT,FRB (Ro=0) 
fres. FRT,FRB (Ro=1) 


59 FRT HI FRB //1 24 |Rec 
0 6 11 16 21 26 31 


A single-precision estimate of the reciprocal of the 
floating-point operand in register FRB is placed into 
register FRT. The estimate placed into register FRT 
is correct to a precision of one part in 256 of the 
reciprocal of (FRB), i.e., 


estimate — 1/x 1 
Fen ee < jae 
aBs( 1/x ) ~ 256 


where x is the initial value in FRB. Note that the 
value placed into register FRT may vary between 
implementations, and between different executions on 
the same implementation. 


Operation with various special values of the operand 
is summarized below. 


Operand Result Exception 
—0o —0 None 

0 od 2X 

+0 tod 2X 

+00 +0 None 

SNaN QNaNn? VXSNAN 
QNaN QNaN None 


'No result if FPSCRz¢ = 1. 
®No result if FPSCRyp = 1. 


FPSCRrppr is set to the class and sign of the result, 


except for Invalid Operation Exceptions when 
FPSCRye=1 and Zero Divide Exceptions when 
FPSCR7¢=1. 


Special Registers Altered: 
FPRF FR (undefined) FI (undefined) 
FX OX UX ZX 
VXSNAN 
CR1 (if Re=1) 
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Floating Reciprocal Square Root 
Estimate A-form 


frsqrte 
frsqrte. 


FRT,FRB 
FRT,FRB 


63 FRT HI FRB HI 26 |Rc 
0 6 11 16 21 26 31 


A double-precision estimate of the reciprocal of the 
square root of the floating-point operand in register 
FRB is placed into register FRT. The estimate placed 
into register FRT is correct to a precision of one part 
in 32 of the reciprocal of the square root of (FRB), i-e., 


estimate — 1/V x. { 
ABS — < =~ 
vx 32 


where x is the initial value in FRB. Note that the 
value placed into register FRT may vary between 
implementations, and between different executions on 
the same implementation. 





Operation with various special values of the operand 
is summarized below. 





Operand Result Exception 
oo QNaN@ VXSORT 

< 0 QNaN?2 VXSQRT 

-0 od 2X 

+0 tod 2X 

too +0 None 

SNaN QNaNn?2 VXSNAN 
QNaN QNaN None 


'No result if FPSCRz_- = 1. 
2No result if FPSCRye = 1. 


FPSCRrppr is set to the class and sign of the result, 


except for Invalid Operation Exceptions when 
FPSCRye=1 and Zero Divide Exceptions when 
FPSCR7¢=1. 


Special Registers Altered: 
FPRF FR (undefined) FI (undefined) 
FX ZX 
VXSNAN VXSQRT 
CR1 (if Re=1) 


5.2.2 Floating-Point Select 
Instruction 


Floating Select A-form 


fsel FRT,FRA,FRC,FRB 
fsel. FRT,FRA,FRC,FRB 


63 FRT FRA FRB FRC 23 |Rc 
0 6 14 16 21 26 31 


if (FRA) = 0.0 then FRT ¢ (FRC) 
else FRT ¢€ (FRB) 


The floating-point operand in register FRA is com- 
pared to the value zero. If the operand is greater 
than or equal to zero, register FRT is set to the con- 
tents of register FRC. If the operand is less than zero 
or is a NaN, register FRT is set to the contents of reg- 
ister FRB. The comparison ignores the sign of zero 
(i.e., regards +0 as equal to — 0). 


Special Registers Altered: 
CR1 (if Re=1) 








Programming Note 


Examples of uses of this instruction can be found 
in Sections C.2, “Floating-Point Conversions” on 
page 158 and C.3, “Floating-Point Selection” on 
page 160. 


Warning: Care must be taken in using fsel if IEEE 
compatibility is required, or if the values being 
tested can be NaNs or infinities; see Section C.3.4, 
“Notes” on page 160. 
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5.3 Little-Endian 


It is computed that eleven Thousand Persons have, at several Times, suffered Death, rather than submit 
to break their Eggs at the smaller End. Many hundred large Volumes have been published upon this 


Controversy .... 


The Little-Endian facility permits a program to access 
storage using Little-Endian byte ordering. 


5.3.1 Byte Ordering 


If scalars (individual data items and instructions) were 
indivisible, then there would be no such concept as 
“byte ordering”. It is meaningless to talk of the 
“order” of bits or groups of bits within the smallest 
addressable unit of storage, because nothing can be 
observed about such order. Only when scalars, which 
the programmer and processor regard as indivisible 
quantities, can be made up of more than one address- 
able unit of storage does the question of “order” 
arise. 


For a machine in which the smallest addressable unit 
of storage is the 64-bit doubleword, there is no ques- 
tion of the ordering of “bytes” within doublewords. 
All transfers of individual scalars to and from storage 
(e.g., between registers and_ storage) are of 
doublewords, and the address of the “byte” con- 
taining the high-order 8 bits of a scalar is no different 
from the address of a “byte” containing any other 
part of the scalar. 


For PowerPC, as for most computers currently avail- 
able, the smallest addressable unit of storage is the 
8-bit byte. Many scalars are halfwords, words, or 
doublewords, which consist of groups of bytes. When 
a word-length scalar is moved from a register to 
storage, the scalar occupies four consecutive byte 
addresses. It thus becomes meaningful to discuss the 
order of the byte addresses with respect to the value 
of the scalar: which byte contains the highest-order 8 
bits of the scalar, which byte contains the next- 
highest-order 8 bits, and so on. 


Given a scalar that spans multiple bytes, the choice of 
byte ordering is essentially arbitrary. There are 
4!= 24 ways to specify the ordering of four bytes 
within a word, but only two of these orderings are 
sensible: 


m= The ordering that assigns the lowest address to 
the highest-order (“leftmost”) 8 bits of the scalar, 


Jonathan Swift, Gulliver's Travels 


the next sequential address to the next-highest- 
order 8 bits, and so on. This is called Big-Endian 
because the “big end” of the scalar, considered 
as a binary number, comes first in storage. IBM 
RISC System/6000, IBM _ System/370, and 
Motorola 680x0 are examples of computers using 
this byte ordering. 


m= The ordering that assigns the lowest address to 
the lowest-order (“rightmost”) 8 bits of the scalar, 
the next sequential address to the next-lowest- 
order 8 bits, and so on. This is called Little- 
Endian because the “little end” of the scalar, 
considered as a binary number, comes first in 
storage. DEC VAX and Intel x86 are examples of 
computers using this byte ordering. 


5.3.2 Structure Mapping Examples 


Figure 39 on page 123 shows an example of a C lan- 
guage structure s containing an assortment of scalars 
and one character string. The value assumed to be in 
each structure element is shown in hex in the C com- 
ments; these values are used below to show how the 
bytes making up each structure element are mapped 
into storage. 


C structure mapping rules permit the use of padding 
(skipped bytes) in order to align the scalars on desir- 
able boundaries. Figures 40 and 41 show each scalar 
aligned at its natural boundary. This alignment intro- 
duces padding of four bytes between a and b, one 
byte between d and e, and two bytes between e and f. 
The same amount of padding is present for both Big- 
Endian and Little-Endian mappings. 


5.3.2.1 Big-Endian Mapping 


The Big-Endian mapping of structure s is shown in 
Figure 40. Addresses are shown in hex at the left of 
each doubleword, and in small figures below each 
byte. The contents of each byte, as indicated in the C 
example in Figure 39, are shown in hex (as characters 
for the elements of the string). 
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struct { 
int a; /* 0x1112_1314 word */ 
double b; /* 0x2122_ 2324 2526 2728 doubleword */ 
char * oc; /* 0x3132_3334 word */ 
char d(7]; /* ‘A’, 'B', 'C', 'D', 'E', 'F', 'G' array of bytes */ 
short e; /*  0x5152 halfword ep 
int f; /* 0x6162_6364 word */ 

} s; 


Figure 39. C structure 's', showing values of elements 


08 
10 


18 





20 


Figure 40. Big-Endian mapping of structure 's' 


5.3.2.2 Little-Endian Mapping 


The same structure s is shown mapped Little-Endian 
in Figure 41. Doublewords are shown laid out from 
right to left, which is the common way of showing 
storage maps for Little-Endian machines. 


11 612 «#613 ~=«14 00 


pr oar mpegs 10 
13 10 


61 62 63 64 20 





Figure 41. Little-Endian mapping of structure 's' 


5.3.3 PowerPC Byte Ordering 


The body of each of the three PowerPC Architecture 
Books, Book I, PowerPC User Instruction Set Architec- 
ture, Book Il, PowerPC Virtual Environment Architec- 
ture, and Book Ill, PowerPC Operating Environment 
Architecture, is written as if a PowerPC system runs 
only in Big-Endian mode. In fact, a PowerPC system 
can instead run in Little-Endian mode, in which the 
instruction set behaves as if the byte ordering were 
Little-Endian, and can change Endian mode dynam- 
ically. The remainder of Section 5.3 describes how 


the mode is controlled, and how running in Little- 
Endian mode differs from running in Big-Endian mode. 


5.3.3.1 Controlling PowerPC Byte 
Ordering 


The Endian mode of a PowerPC processor is con- 
trolled by two bits: the LE (Little-Endian Mode) bit 
specifies the current mode of the processor, and the 
ILE (Interrupt Little-Endian Mode) bit specifies the 
mode that the processor enters when the system 
error handler is invoked. For both bits, a value of 0 
specifies Big-Endian mode and a value of 1 specifies 
Little-Endian mode. The location of these bits and the 
requirements for altering them are described in Book 
Ill, PowerPC Operating Environment Architecture. 


When a PowerPC system comes up after power-on- 
reset, Big-Endian mode is in effect (see Book Ill). 
Thereafter, methods described in Book III can be used 
to change the mode, as can both invoking the system 
error handler and returning from the system error 
handler. 


Programming Note 


For a discussion of software synchronization 
requirements when altering the LE and ILE bits, 
see Book Ill (e.g., to the chapter entitled “Syn- 
chronization Requirements for Context 
Alterations” in Book III). 








5.3.3.2 PowerPC Little-Endian Byte 
Ordering 


One might expect that a PowerPC system running in 
Little-Endian mode would have to perform a 2-way, 
4-way, or 8-way byte swap when transferring a 
halfword, word, or doubleword to or from storage, 
e.g., when transferring data between storage and a 
General Purpose Register or Floating-Point Register, 
when fetching instructions, and when transferring data 
between storage and an Input/Output (I/O) device. 
PowerPC systems do not do such swapping, but 
instead achieve the effect of Little-Endian byte 
ordering by modifying the low-order three bits of the 
effective address (EA) as described below. Individual 
scalars actually appear in storage in Big-Endian byte 
order. 
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The modification affects only the addresses presented 
to the storage subsystem (see Book Ill, PowerPC 
Operating Environment Architecture). All effective 
addresses in architecturally defined registers, as well 
as the Current Instruction Address (CIA) and Next 
Instruction Address (NIA), are independent of Endian 
mode. For example: 


mu The effective address placed into the Link Reg- 
ister by a Branch instruction with LK=1 is equal 
to the CIA of the Branch instruction + 4. 


m The effective address placed into RA by a 
Load/Store with Update instruction is the value 
computed as_ described in the instruction 
description. 


m= The effective addresses placed into System Reg- 
isters when the system error handler is invoked 
(e.g., SRRO, DAR; see Book Ill, PowerPC Oper- 
ating Environment Architecture) are those that 
were computed or would have been computed by 
the interrupted program. 


The modification is performed regardless of whether 
address translation is enabled or disabled and, if 
address translation is enabled, regardless of the 
translation mechanism used (see Book Ill, PowerPC 
Operating Environment Architecture). The actual 
transfer of data and instructions to and from storage 
is unaffected (and thus unencumbered by multiplexors 
for byte swapping). 


The modification of the low-order three bits of the 
effective address in Little-Endian mode is done as 
follows, for access to an individual aligned scalar. 
(Alignment is as determined before this modification.) 
Access to an individual unaligned scalar or to multiple 
scalars is described in subsequent sections, as is 
access to certain architecturally defined data in 
storage, data in caches (e.g., see Book II, PowerPC 
Virtual Environment Architecture, and Book Ill, 
PowerPC Operating Environment Architecture), etc. 


In Little-Endian mode, the effective address is com- 
puted in the same way as in Big-Endian mode. Then, 
in Little-Endian mode only, the low-order three bits of 
the effective address are Exclusive ORed with a 
three-bit value that depends on the length of the 
operand (1, 2, 4, or 8 bytes), as shown in Table 2. 
This modified effective address is then presented to 
the storage subsystem, and data of the specified 
length are transferred to or from the addressed (as 
modified) storage locations(s). 








Data Length (bytes) EA Modification 

1 XOR with 0b111 
XOR with 06110 
XOR with 0b100 


(no change) 














Table 2. PowerPC Little-Endian, effective address 
modification for individual aligned scalars 


The effective address modification makes it appear to 
the processor that individual aligned scalars are 
stored Little-Endian, while in fact they are stored Big- 
Endian but in different bytes within doublewords from 
the order in which they are stored in Big-Endian 
mode. 


For example, in Little-Endian mode structure s would 
be placed in storage as follows, from the point of view 


of the storage subsystem (i.e., after the effective 
address modification described above). 


08 


10 'p! Loa 'B! ‘Al 


11 12 13 


18 


20 





Figure 42. PowerPC Little-Endian, structure 's' in 


storage subsystem 


Figure 42 is identical to Figure 41 except that the byte 
numbers within each doubleword are reversed. (This 
identity is in some sense an artifact of depicting 
storage as a sequence of doublewords. If storage is 
instead depicted as a sequence of words, a single 
byte stream, etc., then no such identity appears. 
However, regardless of the unit in which storage is 
depicted or accessed, the address of a given byte in 
Figure 42 differs from the address of the same byte 
in Figure 41 only in the low-order three bits, and the 
sum of the two 3-bit values that comprise the low- 
order three bits of the two addresses is equal to 7. 
Depicting storage as a sequence of doublewords 
makes this relationship easy to see.) 


Because of the modification performed on effective 
addresses, structure S$ appears to the processor to be 
mapped into storage as follows when the processor is 
in Little-Endian mode. 
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00 
08 
10 
18 
1A 19-18 
61 62 63 64 20 
23. 222120 
Figure 43. PowerPC Little-Endian, structure ‘s' as 


seen by processor 


Notice that, as seen by the program executing in the 
processor, the mapping for structure s is identical to 
the Little-Endian mapping shown in Figure 41. Froma 
point of view outside the processor, however, the 
addresses of the bytes making up structure s are as 
shown in Figure 42. These addresses match neither 
the Big-Endian mapping of Figure 40 nor the Little- 
Endian mapping of Figure 41; allowance must be 
made for this in certain circumstances (e.g., when 
performing I/O; see Section 5.3.7). 


The following four sections describe in greater detail 
the effects of running in Little-Endian mode on 
accessing data, on fetching instructions, on explicitly 
accessing the caches and any address translation 
lookaside buffers (e.g., see Book Il, PowerPC Virtual 
Environment Architecture, and Book Ill, PowerPC 
Operating Environment Architecture), and on doing 
/0. 


5.3.4 PowerPC Data Addressing in 
Little-Endian Mode 


5.3.4.1 Individual Aligned Scalars 


When the storage operand is aligned for any instruc- 
tion in the following classes, the effective address 
presented to the storage subsystem is computed as 
described in Section 5.3.3.2: Fixed-Point Load, Fixed- 
Point Store, Load and Store with Byte Reversal, 
Floating-Point Load, Floating-Point Store (including 
stfiwx), and Load And Reserve and Store Conditional 
(see Book Il). 


The Load and Store with Byte Reversal instructions 
have the effect of loading or storing data in the oppo- 
site Endian mode from that in which the processor is 
running. That is, data are loaded or stored in Little- 
Endian order if the processor is running in Big-Endian 
mode, and in Big-Endian order if the processor is 
running in Little-Endian mode. 


5.3.4.2 Other Scalars 


As described below, the system alignment error 
handler may be (see the subsection entitled “Indi- 
vidual Unaligned Scalars”) or is (see the subsection 
entitled “Multiple Scalars”) invoked if attempt is made 
in Little-Endian mode to execute any of the 
instructions described in the following two sub- 
sections. 


Individual Unaligned Scalars 


The “trick” of Exclusive ORing the low-order three bits 
of the effective address of an individual scalar does 
not work unless the scalar is aligned. In Little-Endian 
mode, PowerPC processors may cause the system 
alignment error handler to be invoked whenever any 
of the Load or Store instructions listed in Section 
5.3.4.1 is issued with an unaligned effective address, 
regardless of whether such an access could be 
handled without invoking the system alignment error 
handler in Big-Endian mode. 


PowerPC processors are not required to invoke the 
system alignment error handler when an unaligned 
access is attempted in Little-Endian mode. The imple- 
mentation may handle some or all such accesses 
without invoking the system alignment error handler, 
just as in Big-Endian mode. The architectural require- 
ment is that halfwords, words, and doublewords be 
placed in storage such that the Little-Endian effective 
address of the lowest-order byte is the effective 
address computed by the Load or Store instruction, 
the Little-Endian address of the next-lowest-order byte 
is one greater, and so on. (Load And Reserve and 
Store Conditional differ somewhat from the rest of the 
instructions listed in Section 5.3.4.1, in that neither 
the implementation nor the system alignment error 
handler is expected to handle these four instructions 
“correctly” if their operands are not aligned.) 


Figure 44 shows an example of a word w stored at 
Little-Endian address 5. The word is assumed to 
contain the binary value 0x1112_1314. 





Figure 44. Little-Endian mapping of word 'w' stored 
at address 5 


In Little-Endian mode word w would be placed in 
storage as follows, from the point of view of the 
storage subsystem (i.e., after the effective address 
modification described in Section 5.3.3.2). 
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Figure 45. PowerPC Little-Endian, word 'w' stored at 
address 5 in storage subsystem 


Notice that the unaligned word w in Figure 45 spans 
two doublewords. The two parts of the unaligned 
word are not contiguous as seen by the storage sub- 
system. 


An implementation may choose to support some but 
not all unaligned Little-Endian accesses. For example, 
an unaligned Little-Endian access that is contained 
within a single doubleword may be supported, while 
one that spans doublewords may cause the system 
alignment error handler to be invoked. 


Multiple Scalars 


PowerPC has two classes of instructions that handle 
multiple scalars, namely the Load and Store Multiple 
instructions and the Move _ Assist instructions. 
Because both classes of instructions potentially deal 
with more than one word-length scalar, neither class 
is amenable to the effective address modification 
described in Section 5.3.3.2 (e.g., pairs of aligned 
words would be accessed in reverse order from what 
the program would expect). Attempting to execute 
any of these instructions in Little-Endian mode causes 
the system alignment error handler to be invoked. 


5.3.4.3 Page Table 


The layout of the Page Table in storage (see Book Ill, 
PowerPC Operating Environment Architecture) is 
independent of Endian mode. A given byte in the 
Page Table must be accessed using an effective 
address appropriate to the mode of the executing 
program (e.g., the high-order byte of a Page Table 
Entry must be accessed with an effective address 
ending with 0b000 in Big-Endian mode, and with an 
effective address ending with 0b111 in Little-Endian 
mode). 


5.3.5 PowerPC Instruction 
Addressing in Little-Endian Mode 


Each PowerPC instruction occupies an aligned word in 
storage. The processor fetches and executes 
instructions as if the CIA were advanced by four for 
each sequentially fetched instruction. When the 
processor is in Little-Endian mode, the effective 


address presented to the storage subsystem in order 
to fetch an instruction is the value from the CIA, mod- 
ified as described in Section 5.3.3.2 for aligned word- 
length scalars. A Little-Endian program is thus an 
array of aligned Little-Endian words, with each word 
fetched and executed in order (discounting branches 
and invocations of the system error handler). 


Figure 46 shows an example of a small assembly lan- 
guage program p. 


loop: 
cmplwi 15,0 
beg done 
lwzux r4,r5,r6 
add 17,17,74 
subi r5,75,4 
b loop 
done: 
stw r7, total 


Figure 46. Assembly language program 'p' 


The Big-Endian mapping for program p is shown in 
Figure 47 (assuming the program starts at address 0). 


00 ee a r5,0 pe done 


08 nee r4, me r6 ae r7,r7,r4 
10 ai x5, = 4 . blow 
11 16 
18 a stw r7, rea 
18 


Figure 47. Big-Endian mapping of program 'p' 





The same program p is shown mapped Little-Endian 
in Figure 48. 


i done 


ae r7,r7,r4 or r4,r5,r6 08 
5 = HET x5, =a 4 10 
15 11 
cor stw r7, Sai 18 
18 


Figure 48. Little-Endian mapping of program 'p' 


oe se r5,0 00 





In Little-Endian mode program p would be placed in 
storage as follows, from the point of view of the 
storage subsystem (i.e., after the effective address 
modification described in Section 5.3.3.2). 
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00 beq done loop: cmplwi r5,0 


00 01 02 03 04 05 06 07 
lwzux r4,r5,r6 
0C oD OE OF 


08 add r7,r7,r4 


10 subi r5,r5,4 
16 17 
18 done: stw r7,total 
ic 1D 1E 1F 





Figure 49. PowerPC Little-Endian, program 'p' in 
storage subsystem 


Figure 49 is identical to Figure 48 on page 126 
except that the byte numbers within each doubleword 
are reversed. (This identity is in some sense an 
artifact of depicting storage as a sequence of 
doublewords. If storage is instead depicted as a 
sequence of words, a single byte stream, etc., then no 
such identity appears. However, regardless of the 
unit in which storage is depicted or accessed, the 
address of a given byte in Figure 49 differs from the 
address of the same byte in Figure 48 on page 126 
only in the low-order three bits, and the sum of the 
two 3-bit values that comprise the low-order three 
bits of the two addresses is equal to 7. Depicting 
storage as a sequence of doublewords makes this 
relationship easy to see.) 


Each individual machine instruction appears in 
storage as a 32-bit integer containing the value 
described in the instruction description, regardless of 
the Endian mode. This is a consequence of the fact 
that individual aligned scalars are mapped in storage 
in Big-Endian byte order. 


Notice that, as seen by the processor when executing 
program p, the mapping for program p is identical to 
the Little-Endian mapping shown in Figure 48. From a 
point of view outside the processor, however, the 
addresses of the bytes making up program p are as 
shown in Figure 49. These addresses match neither 
the Big-Endian mapping of Figure 47 nor the Little- 
Endian mapping of Figure 48. 


All instruction effective addresses visible to an exe- 
cuting program are the effective addresses that are 
computed by that program or, in the case of the 
system error handler, effective addresses that were 
or could have been computed by the interrupted 
program. These effective addresses are independent 
of Endian mode. Examples for Little-Endian mode 
include the following. 


m An instruction address placed into the Link Reg- 
ister by a Branch instruction with LK=1, or an 
instruction address saved in a System Register 
when the system error handler is invoked, is the 


effective address that a program executing in 
Little-Endian mode would use to access the 
instruction as a data word using a Load instruc- 
tion. 


us An offset in a relative Branch instruction (Branch 
or Branch Conditional with AA=0) reflects the 
difference between the addresses of the branch 
and target instructions, using the addresses that 
a program executing in Little-Endian mode would 
use to access the instructions as data words 
using Load instructions. 


= A target address in an absolute Branch instruc- 
tion (Branch or Branch Conditional with AA=1) is 
the address that a program executing in Little- 
Endian mode would use to access the target 
instruction as a data word using a Load instruc- 
tion. 


m= The storage locations that contain the first set of 
instructions executed by each kind of system 
error handler must be set in a manner consistent 
with the Endian mode in which the system error 
handler will be invoked. (These sets. of 
instructions occupy architecturally defined 
locations; see Book Ill, PowerPC Operating Envi- 
ronment Architecture.) Thus if the system error 
handler is to be invoked in Little-Endian mode, 
the first set of instructions for each kind of 
system error handler must appear in storage, 
from the point of view of the storage subsystem 
(i.e., after the effective address modification 
described in Section 5.3.3.2), with the pair of 
instructions within each doubleword reversed 
from the order in which they are to be executed. 
(If the instructions are placed into storage by a 
program running in the same Endian mode as 
that in which the system error handler will be 
invoked, the appropriate order will be achieved 
naturally.) 





Programming Note 


In general, a given subroutine in storage cannot 
be shared between programs running in different 
Endian modes. This affects the sharing of subrou- 
tine libraries. 





5.3.6 PowerPC Cache Management 
Instructions in Little-Endian Mode 


Instructions for explicitly accessing the caches (see 
Book Il, PowerPC Virtual Environment Architecture) 
are unaffected by Endian mode. (Identification of the 
block to be accessed is not affected by the low-order 
three bits of the effective address.) 
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5.3.7 PowerPC 1/O in Little-Endian 
Mode 


Input/output (I/O), such as writing the contents of a 
large area of storage to disk, transfers a byte stream 
on both Big-Endian and Little-Endian systems. For the 
disk transfer, the first byte of the area is written to 
the first byte of the disk record and so on. 


For a PowerPC system running in Big-Endian mode, 
/O transfers happen “naturally” because the byte 
that the processor sees as byte 0 is the same one 
that the storage subsystem sees as byte 0. 


For a PowerPC system running in Little-Endian mode 
this is not the case, because of the modification of the 
low-order three bits of the effective address when the 
processor accesses storage. In order for I/O transfers 
to transfer byte streams properly, in Little-Endian 
mode I/O transfers must be performed as if the bytes 
transferred were accessed one byte at a time, using 
the address modification described in Section 5.3.3.2 
for single-byte scalars. This does not mean that I/O 
on Little-Endian PowerPC systems must use only 
1-byte-wide transfers; data transfers can be as wide 
as desired, but the order of the bytes transferred 
within doublewords must appear as if the bytes were 
fetched or stored one byte at a time. See the System 
Architecture documentation for a given PowerPC 
system for details on the transfer width and byte 
ordering on that system. 


However, not all I/O done on PowerPC systems is for 
large areas of storage as described above. I/O can 
be performed with certain devices merely by storing 
to or loading from addresses that are associated with 
the devices (the terms “memory-mapped I/O” and 
“programmed I/O” or “PIO” are used for this). For 
such PIO transfers, care must be taken when defining 
the addresses to be used, for these addresses are 
subject to the effective address modification shown in 
Table 2 on page 124. A Load or Store instruction 
that maps to a control register on a device may 
require that the value loaded or stored have its bytes 
reversed; if this is required, the Load and Store with 
Byte Reversal instructions can be used. Any require- 
ment for such byte reversal for a particular I/O device 
register is independent of whether the PowerPC 
system is running in Big-Endian or Little-Endian mode. 


Similarly, the address sent to an I/O device by an 
eciwx or ecowx instruction (see Book Il, PowerPC 
Virtual Environment Architecture) is subject to the 
effective address modification shown in Table 2. 


5.3.8 Origin of Endian 


The terms Big-Endian and Little-Endian come from 
Part |, Chapter 4, of Jonathan Swift's Gulliver's 
Travels. Here is the complete passage, from the 
edition printed in 1734 by George Faulkner in Dublin. 


... our Histories of six Thousand Moons make 
no Mention of any other Regions, than the 
two great Empires of Lilliput and Blefuscu. 
Which two mighty Powers have, as | was 
going to tell you, been engaged in a most 
obstinate War for six and thirty Moons past. 
It began upon the following Occasion. It is 
allowed on all Hands, that the primitive Way 
of breaking Eggs before we eat them, was 
upon the larger End: But his present Majes- 
ty's Grand-father, while he was a Boy, going 
to eat an Egg, and breaking it according to 
the ancient Practice, happened to cut one of 
his Fingers. Whereupon the Emperor his 
Father, published an Edict, commanding all 
his Subjects, upon great Penalties, to break 
the smaller End of their Eggs. The People so 
highly resented this Law, that our Histories 
tell us, there have been six Rebellions raised 
on that Account; wherein one Emperor lost 
his Life, and another his Crown. These civil 
Commotions were constantly fomented by the 
Monarchs of Blefuscu; and when they were 
quelled, the Exiles always fled for Refuge to 
that Empire. It is computed that eleven 
Thousand Persons have, at several Times, 
suffered Death, rather than submit to break 
their Eggs at the smaller End. Many hundred 
large Volumes have been published upon this 
Controversy: But the Books of the Big- 
Endians have been long forbidden, and the 
whole Party rendered incapable by Law of 
holding Employments. During the Course of 
these Troubles, the Emperors of Blefuscu did 
frequently expostulate by their Ambassadors, 
accusing us of making a Schism in Religion, 
by offending against a fundamental Doctrine 
of our great Prophet Lustrog, in the fifty- 
fourth Chapter of the Brundrecal, (which is 
their Alcoran.) This, however, is thought to 
be a mere Strain upon the text: For the 
Words are these; That all true Believers shall 
break their Eggs at the convenient End: and 
which is the convenient End, seems, in my 
humble Opinion, to be left to every Man's 
Conscience, or at least in the Power of the 
chief Magistrate to determine. Now the Big- 
Endian Exiles have found so much Credit in 
the Emperor of Blefuscu's Court; and so 
much private Assistance and Encouragement 
from their Party here at home, that a bloody 
War has been carried on between the two 
Empires for six and thirty Moons with various 
Success; during which Time we have lost 
Forty Capital Ships, and a much greater 
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Number of smaller Vessels, together with 
thirty thousand of our best Seamen and Sol- 
diers; and the Damage received by the 
Enemy is reckoned to be somewhat greater 
than ours. However, they have now 
equipped a numerous Fleet, and are just pre- 


paring to make a Descent upon us: and his 
Imperial Majesty, placing great Confidence in 
your Valour and Strength, hath commanded 
me to lay this Account of his Affairs before 
you. 
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Chapter 6. Optional Facilities and Instructions that are being 


Phased Out of the Architecture 


6.1 Move To Condition Register from 
KERS <4 4s oe ae ee ee 131 


The facilities and instructions described in this 
chapter are optional. An implementation may provide 
all, some, or none of them. 


Warning: These facilities and instructions are being 
phased out of the architecture. 


6.1 Move To Condition Register 
from XER 


The merxr instruction is being phased out of the archi- 
tecture. Its description is included here as an aid to 
constructing operating system code to emulate it. 


Move to Condition Register from XER 
X-form 


mcerxr BF 


31 BF | //} /// //I 512 / 
0 6 9 |11 16 21 31 


CRa,BF:4xBF+3 € XERgo-35 
XERgo-95 ¢ 0b0000 


The contents of XER3o.35 are copied to Condition Reg- 
ister field BF. XERg9.35 are set to zero. 


Special Registers Altered: 
CR field BF XER 39.35 





Programming Note 


Warning: This instruction has been phased out of 
the architecture. Attempting to execute this 


instruction will cause the system illegal instruction 
error handler to be invoked. 
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Appendix A. Suggested Floating-Point Models 


A.1 Floating-Point Round to Single-Precision Model 


The following describes algorithmically the operation of the Floating Round to Single-Precision instruction. 


If (FRB),.4; < 897 and (FRB),.63 > 0 then 
Do 
If FPSCRye = 0 then goto Disabled Exponent Underflow 
If FPSCRye = 1 then goto Enabled Exponent Underflow 
End 


If (FRB),-;; > 1150 and (FRB),.4; < 2047 then 
Do 


If FPSCRo¢ = 0 then goto Disabled Exponent Overflow 
If FPSCRog = 1 then goto Enabled Exponent Overflow 
End 


If (FRB)4.44 > 896 and (FRB),.;; < 1151 then goto Normal Operand 
If (FRB)1.¢3 = 0 then goto Zero Operand 


If (FRB),.44 = 2047 then 
Do 
If (FRB)40.63 = 0 then goto Infinity Operand 
If (FRB),> = 1 then goto QNaN Operand 
If (FRB);2 = O and (FRB)13.63 > 0 then goto SNaN Operand 
End 
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Disabled Exponent Underflow: 


sign ¢ (FRB) 
If (FRB);.44 = 0 then 
Do 
exp € — 1022 
fraco.5o ¢ 0b0 II (FRB) 49.63 
End 
If (FRB),.4; > 0 then 
Do 
exp € (FRB),.44 — 1023 
fraco.5o € Ob1 Il (FRB)40.63 
End 
Denormalize operand: 
GIl Rll X € Ob000 
Do while exp < — 126 
exp € exp + 1 
fraco.5o GIR X € Ob0 Il fracg-50 Gil (R | X) 
End 
FPSCRy x ¢ (fracog.50 IGIUR Il X) > 0 
Round Single(sign,exp,fracg-5o,G,R,X) 
FPSCRxy € FPSCRxx | FPSCR», 
If fracg.5o = O then 


Do 
FRT) € sign 
If sign = O then FPSCReppp ¢ “+ zero” 
If sign = 1 then FPSCReppp € “- zero” 
End 
If fraco.so > 0 then 
Do 
If fracg = 1 then 
Do 
If sign = O then FPSCReppp € “+ normal number” 
If sign = 1 then FPSCReppp ¢ “— normal number” 
End 
If fracg = O then 
Do 
If sign = O then FPSCReppp ¢ “+ denormalized number” 
If sign = 1 then FPSCRepap ¢ “— denormalized number” 
End 


Normalize operand: 
Do while fracg = 0 


exp € exp-1 
fraco.so ¢ fracy.50 I| ObO 
End 
FRT9 € sign 


FRT4-44 ¢ exp + 1023 
FRT j2-63 € fracy.se 
End 
Done 
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Enabled Exponent Underflow: 


FPSCRyx € 1 

sign ¢ (FRB) 

If (FRB);.44 = 0 then 
Do 


exp € — 1022 
fraCo.55 € Ob0 II (FRB)40.63 
End 
If (FRB),.4; > 0 then 
Do 


exp € (FRB),.4, — 1023 
fraco.50 € Ob1 II (FRB)40.63 
End 
Normalize operand: 
Do while fracy = 0 
exp € exp — 1 
fraco.so ¢ fracy.5o I| ObO 
End 
Round Single(sign,exp,fraco.so,0,0,0) 
FPSCRxx € FPSCRxx | FPSCRe, 
exp € exp + 192 
FRT) € sign 
FRT,.44 € exp + 1023 
FRT j2.63 € fracy.5o 
If sign = 0 then FPSCReprr + “+normal number” 
If sign = 1 then FPSCReppp € “— normal number” 
Done 


Disabled Exponent Overflow: 


FPSCRox € 1 
If FPSCRpy = Ob00 then /* Round to Nearest */ 
Do 
If (FRB)y = 0 then FRT ¢ 0x7FFO_0000_0000_0000 
If (FRB)g = 1 then FRT ¢ OxFFFO_0000_0000_0000 
If (FRB) y = O then FPSCReprpr ¢ “+ infinity” 
If (FRB)g = 1 then FPSCReprr ¢ “— infinity” 
End 
If FPSCRan = 0601 then /* Round toward Zero */ 
Do 
If (FRB)y = O then FRT ¢ 0x47EF_FFFF_E000_0000 
If (FRB)y = 1 then FRT € 0xC7EF_FFFF_E000_0000 
If (FRB)g = O then FPSCReppr ¢ “+ normal number” 
If (FRB)y = 1 then FPSCReprr € “— normal number” 
End 
If FPSCRpy = 0b10 then /* Round toward +Infinity */ 
Do 
If (FRB)y = 0 then FRT € 0x7FFO_0000_0000_0000 
If (FRB)y = 1 then FRT € 0xC7EF_FFFF_E000_0000 
If (FRB) y = O then FPSCReprpr ¢ “+ infinity” 
If (FRB)g = 1 then FPSCReppr ¢ “— normal number” 
End 
If FPSCRay = 0611 then /* Round toward — Infinity */ 
Do 
If (FRB)y = O then FRT ¢ 0x47EF_FFFF_E000_0000 
If (FRB)y = 1 then FRT ¢ OxFFFO_0000_0000_0000 
If (FRB)g = O then FPSCReppr ¢ “+ normal number” 
If (FRB)y = 1 then FPSCReprr € “— infinity” 
End 
FPSCRep € undefined 
FPSCRe ¢ 1 
FPSCRyy € 1 
Done 
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Enabled Exponent Overflow: 


sign ¢ (FRB) 
exp ¢€ (FRB),.4, — 1023 
fraco.5o € Ob1 II (FRB)10-63 


Round Single(sign,exp,fracg-5o,0,0,0) 


FPSCRxyx € FPSCRxx | FPSCR», 
Enabled Overflow: 

FPSCRox € 1 

exp € exp — 192 

FRT) € sign 

FRT 4-44 ¢€ exp + 1023 

FRT 19.63 € fracy.so 


If sign = 0 then FPSCReppr + “+normal number” 
If sign = 1 then FPSCReppp ¢ “— normal number” 


Done 


Zero Operand: 


FRT € (FRB) 

If (FRB) y = O then FPSCReppr ¢ 
If (FRB)g = 1 then FPSCReppr 
FPSCRrp fF; € 0600 

Done 


Infinity Operand: 


FRT ¢ (FRB) 

If (FRB) y = O then FPSCReppr ¢ 
If (FRB)g = 1 then FPSCReppr 
FPSCRrp fF; € 0600 

Done 


QNaN Operand: 


FRT € (FRB)o-34 II 290 
FPSCRepar ¢ “QNaN” 
FPSCRep fF ¢ 0600 
Done 


SNaN Operand: 


If FPSCRye = 0 then 
Do 
FRT9.44 ¢ (FRB)o-14 
FRTy> ¢ 1 
FRT 13:63 ¢ (FRB)13:34 |! 790 
FPSCReppe ¢ “QNaN” 
End 
FPSCRep f) € 0b00 
Done 


“+ Zero” 
“_ Zero” 


“+ infinity” 
“— infinity” 
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Normal Operand: 


sign ¢ (FRB) 

exp ¢€ (FRB),.4, — 1023 

fraco.50 € Ob1 Il (FRB)10.63 

Round Single(sign,exp,fracg-5o,0,0,0) 

FPSCRxx € FPSCRxx | FPSCRe, 

If exp > 127 and FPSCRo¢ = 0 then go to Disabled Exponent Overflow 
If exp > 127 and FPSCRo¢ = 1 then go to Enabled Overflow 
FRT) € sign 

FRT 4-44 ¢ exp + 1023 

FRT j2.63 € fracy.so 

If sign = O then FPSCReppp ¢ “+ normal number” 

If sign = 1 then FPSCReppp ¢ “— normal number” 

Done 


Round Single(sign, exp, fraco.50,G,R,X): 


inc ¢ 0 

Isb € fracog 

gbit ¢ fraco, 

rbit € fracos 

xbit € (fraCog-sollGIIRIIX)# 0 

If FPSCRry = 0b00 then /* Round to Nearest */ 

Do /* comparisons ignore u bits */ 

If sign Il Isb Il gbit Il rbit Il xbit = Obu11uu then inc ¢ 1 
If sign Il Isb Il gbit Il rbit Il xbit = Obu011u then inc ¢ 1 
If sign Il Isb Il gbit Il rbit Il xbit = ObuO1u1 then inc ¢ 1 


End 
If FPSCRay = 0610 then /* Round toward +Infinity */ 
Do /* comparisons ignore u bits */ 


If sign Il Isb Il gbit Il rbit Il xbit = ObOutuu then inc € 1 
If sign Il Isb Il gbit Il rbit Il xbit = ObOuutu then inc € 1 
If sign Il Isb Il gbit Il rbit Il xbit = ObOuuu1 then inc € 1 


End 
If FPSCRay = 0611 then /* Round toward — Infinity */ 
Do /* comparisons ignore u bits */ 


If sign Il Isb Il gbit Il rbit Il xbit = Ob1utuu then inc € 1 
If sign Il Isb Il gbit Il rbit Il xbit = Obituutu then inc € 1 
If sign Il Isb Il gbit Il rbit Il xbit = Obtuuul then inc ¢ 1 
End 
fraco.03 € fracg.o3 + inc 
If carry_out = 1 then 
Do 
fraco.o3 ¢€ 0b1 Il fraco..0 
exp € exp + 1 
End 
fracog-50 ¢ 299 
FPSCReR € inc 
FPSCRr, € gbit | rbit | xbit 
Return 
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A.2 Floating-Point Convert to Integer Model 


The following describes algorithmically the operation of the Floating Convert To Integer instructions. 


If Floating Convert To Integer Word then 
Do 
round_mode ¢ FPSCRrn 
tgt_precision € “32-bit integer” 
End 


If Floating Convert To Integer Word with round toward Zero then 
Do 
round_mode ¢ Ob01 
tgt_precision € “32-bit integer” 
End 


If Floating Convert To Integer Doubleword then 
Do 
round_mode ¢ FPSCRan 
tgt_precision ¢ “64-bit integer” 
End 


If Floating Convert To Integer Doubleword with round toward Zero then 
Do 
round_mode ¢ 0b01 
tgt_precision ¢ “64-bit integer” 
End 


ie ¢ (FRB)» 
If (FRB)4-44 = 2047 and (FRB)40.g¢3 = 0 then goto Infinity Operand 
If (FRB),.4; = 2047 and (FRB);> = 0 then goto SNaN Operand 
If (FRB),.;; = 2047 and (FRB),> = 1 then goto QNaN Operand 

If ( )4:11 > 1086 then goto Large Operand 


f (FRB);.44 > 0 then exp ¢ (FRB),.4,-— 1023 /* exp — bias */ 

f (FRB)1.4, = O then exp € — 1022 

f ( )4-11 > O then fraco.g4 ¢ 0601 Il (FRB);2-¢3 !1 110 /* normal; need leading 0 for later complement */ 
f (FRB),.44 = O then fraco.g4 € 0b00 II (FRB)49.63 | 110 /* denormal */ 


gbit II rbit Il xbit € Ob000 

Do i=1,63-—exp /* do the loop 0 times if exp = 63 */ 
fraco.g4 Il gbit II rbit Il xbit € ObO II frac 

End 


Round Integer(sign,fraco.g4,gbit, rbit,xbit,round_mode) 

If sign = 1 then fraco.g4 ¢ - fracog4+ 1 /* needed leading 0 for — 26+ < (FRB) < — 26 */ 
If tgt_precision = “32-bit integer” and fraco.g4 > 23'- 1 then goto Large Operand 

If tgt_precision = “64-bit integer” and fraco.g4 > 2°31 then goto Large Operand 


If tgt_precision = “32-bit integer” and fracg.g4 < — 2°! then goto Large Operand 
If tgt_precision = “64-bit integer” and fraco.g4 < — 26 then goto Large Operand 


FPSCRyy € FPSCRyy | FPSCRE 


If tgt_precision = “32-bit integer” then FRT € Oxuuuu_uuuu II frac33.64 /* u is undefined hex digit */ 
If tgt_precision = “64-bit integer” then FRT € fracy.g4 

FPSCRrppr ¢ undefined 

Done 
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Round Integer(sign, fraco.g4,gbit, rbit, xbit, round_mode): 


inc ¢€ 0 
If round_mode = 0b00 then /* Round to Nearest */ 
Do /* comparisons ignore u bits */ 


If sign Il fracg, Il gbit Il rbit || xbit = Obuittuu then inc ¢ 1 
If sign Il fracg, Il gbit Il rbit I| xbit = Obu011u then inc ¢ 1 
If sign Il fracg, Il gbit Il rbit || xbit = ObuO1u1 then inc ¢ 1 


End 
If round_mode = 0b10 then /* Round toward +Infinity */ 
Do /* comparisons ignore u bits */ 


If sign Il fracg, Il gbit Il rbit || xbit = ObOutuu then inc ¢ 1 
If sign Il fracg, Il gbit Il rbit || xbit = ObOuutu then inc ¢ 1 
If sign Il fracg, Il gbit II rbit || xbit = ObOuuul then inc ¢ 1 


End 
If round_mode = 0b11 then /* Round toward — Infinity */ 
Do /* comparisons ignore u bits */ 


If sign Il fracg, Il gbit Il rbit || xbit = Ob1utuu then inc ¢ 1 
If sign Il fracg, Il gbit Il rbit || xbit = Ob1uutu then inc ¢ 1 
If sign Il fracg, Il gbit Il rbit || xbit = Ob1uuul then inc € 1 
End 

fraco.g4 € fraco.g4, + inc 

FPSCReR € inc 

FPSCRr, € gbit | rbit | xbit 

Return 


Infinity Operand: 


FPSCRer Fi vxcvi € 06001 
If FPSCRye = 0 then Do 
If tgt_precision = “32-bit integer” then 
Do 
If sign = O then FRT ¢ Oxuuuu_uuuu_7FFF_FFFF /* u is undefined hex digit */ 
If sign = 1 then FRT € Oxuuuu_uuuu_8000_0000 /* u is undefined hex digit */ 
End 
Else 
Do 
If sign = O then FRT ¢ Ox7FFF_FFFF_FFFF_FFFF 
If sign = 1 then FRT ¢ 0x8000_0000_0000_0000 
End 
FPSCReprr ¢ undefined 
End 
Done 


SNaN Operand: 


FPSCRer Fi vxsnan vxcvi € 00011 
If FPSCRye = 0 then 
Do 
If tgt_precision = “32-bit integer” then FRT € Oxuuuu_uuuu_8000_0000 /* u is undefined hex digit */ 
If tgt_precision = “64-bit integer” then FRT ¢ 0x8000_0000_0000_0000 
FPSCReppr ¢ undefined 
End 
Done 
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QNaN Operand: 


FPSCRer Fi vxcvi € 06001 
If FPSCRye = 0 then 
Do 
If tgt_precision = “32-bit integer” then FRT € Oxuuuu_uuuu_8000_0000 /* u is undefined hex digit */ 
If tgt_precision = “64-bit integer” then FRT € 0x8000_0000_0000_0000 
FPSCReppr ¢ undefined 
End 
Done 


Large Operand: 


FPSCRer Fi vxcvi € 06001 
If FPSCRye = 0 then Do 
If tgt_precision = “32-bit integer” then 
Do 
If sign = Othen FRT € Oxuuuu_uuuu_7FFF_FFFF /* u is undefined hex digit */ 
If sign = 1 then FRT € Oxuuuu_uuuu_8000_0000 /* u is undefined hex digit */ 
End 
Else 
Do 
If sign = O then FRT ¢ Ox7FFF_FFFF_FFFF_FFFF 
If sign = 1 then FRT € 0x8000_0000_0000_0000 
End 
FPSCReprr ¢ undefined 
End 


Done 
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A.3 Floating-Point Convert from Integer Model 


The following describes algorithmically the operation of the Floating Convert From Integer Doubleword instruction. 


sign ¢ (FRB) 
exp ¢ 63 
fraco.g3 ¢ (FRB) 


If fraco.g3 = O then go to Zero Operand 
If sign = 1 then fraco.g3 ¢ = fracg.g3 + 1 


Do while fracg = 0 /* do the loop O times if (FRB) = maximum negative integer */ 
fraco.g3 ¢ fracy.¢3 || ObO 
exp € exp — 1 

End 


Round Float(sign,exp, fraco.g3,F PSCRan) 


If sign = 0 then FPSCReppr + “+normal number” 
If sign = 1 then FPSCReppp ¢ “— normal number” 
FRT) € sign 

FRT,.44 € exp + 1023 /* exp + bias */ 

FRT j2.63 € fracy.s 

Done 


Zero Operand: 
FPSCRep Ff) ¢ Ob00 
FPSCRrpprr € “+zero” 


FRT € 0x0000 0000 0000 0000 
Done 
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Round Float(sign, exp, fraco.g3,round_mode): 


inc € 0 

Isb € fracso 

gbit ¢ fracs, 

rbit € fracs,4 

xbit € fracss5.93 > 0 

If round_mode = 0b00 then /* Round to Nearest */ 

Do /* comparisons ignore u bits */ 

If sign Il Isb Il gbit Il rbit Il xbit = Obui1tuu then inc ¢ 1 
If sign Il Isb Il gbit Il rbit Il xbit = Obu011u then inc ¢ 1 
If sign Il Isb Il gbit Il rbit Il xbit = ObuO1u1 then inc ¢ 1 


End 
If round_mode = 0b10 then /* Round toward +Infinity */ 
Do /* comparisons ignore u bits */ 


If sign Il Isb Il gbit Il rbit Il xbit = ObOutuu then inc € 1 
If sign Il Isb Il gbit Il rbit Il xbit = ObOuutu then inc € 1 
If sign Il Isb Il gbit Il rbit Il xbit = ObOuuui1 then inc € 1 


End 
If round_mode = 0b11 then /* Round toward — Infinity */ 
Do /* comparisons ignore u bits */ 
If sign Il Isb Il gbit Il rbit Il xbit = Obtutuu then inc ¢ 1 
If sign Il Isb Il gbit Il rbit Il xbit = Obtuutu then inc € 1 
If sign Il Isb Il gbit Il rbit Il xbit = Obtuuul then inc + 1 
End 


fraco.so ¢ fraco.s55 + inc 

If carry_out = 1 then exp ¢ exp + 1 
FPSCReR € inc 

FPSCRr, € gbit | rbit | xbit 

FPSCRxyx € FPSCRxx | FPSCR», 
Return 
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Appendix B. Assembler Extended Mnemonics 


In order to make assembler language programs simpler to write and easier to understand, a set of extended 
mnemonics and symbols is provided that defines simple shorthand for the most frequently used forms of Branch 
Conditional, Compare, Trap, Rotate and Shift, and certain other instructions. 


Assemblers should provide the extended mnemonics and symbols listed here, and may provide others. 


B.1 Symbols 


The following symbols are defined for use in instructions (basic or extended mnemonics) that specify a Condition 
Register field or a Condition Register bit. The first five (It, .., un) identify a bit number within a CR field. The 
remainder (cr0, ..., cr7) identify a CR field. An expression in which a CR field symbol is multiplied by 4 and then 
added to a bit-number-within-CR-field symbol can be used to identify a CR bit. 


Symbol Value Meaning 


It 0 Less than 

gt 1 Greater than 

eq 2 Equal 

so 3 Summary overflow 
un 3 Unordered (after floating-point comparison) 
cr0 0 CR Field 0 

cri 1 CR Field 1 

cr2 2 ~~ CR Field 2 

cr3 3 CR Field 3 

cr4 4 CR Field 4 

cr5 5 CR Field 5 

cr6 6 CR Field 6 

cr7 7 ~~ CR Field 7 


The extended mnemonics in Sections B.2.2 and B.3 require identification of a CR bit: if one of the CR field symbols 
is used, it must be multiplied by 4 and added to a bit-number-within-CR-field (value in the range 0-3, explicit or 
symbolic). The extended mnemonics in Sections B.2.3 and B.5 require identification of a CR field: if one of the CR 
field symbols is used, it must not be multiplied by 4. (For the extended mnemonics in Section B.2.3, the bit 
number within the CR field is part of the extended mnemonic. The programmer identifies the CR field, and the 
Assembler does the multiplication and addition required to produce a CR bit number for the BI field of the under- 
lying basic mnemonic.) 
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B.2 Branch Mnemonics 


The mnemonics discussed in this section are variations of the Branch Conditional instructions. 


Note: belr, beirl, bectr, and bectrl each serve as both a basic and an extended mnemonic. The Assembler will 
recognize a belr, belirl, bectr, or bectrl mnemonic with three operands as the basic form, and a belr, belrl, bectr, or 
bectrl mnemonic with two operands as the extended form. In the extended form the BH operand is omitted and 
assumed to be 0b00. Similarly, for all the extended mnemonics described in Sections B.2.2 - B.2.4 that devolve to 
any of these four basic mnemonics the BH operand can either be coded or omitted. If it is omitted it is assumed 
to be 0b00. 


B.2.1 BO and BI Fields 


The 5-bit BO and BI fields control whether the branch is taken. Providing an extended mnemonic for every pos- 
sible combination of these fields would be neither useful nor practical. The mnemonics described in Sections 
B.2.2 - B.2.4 include the most useful cases. Other cases can be coded using a basic Branch Conditional mne- 
monic (be[![a], belr[f, bectr[f) with the appropriate operands. 


B.2.2 Simple Branch Mnemonics 


Instructions using one of the mnemonics in Table 3 that tests a Condition Register bit specify the corresponding 
bit as the first operand. The symbols defined in Section B.1 can be used in this operand. 


Notice that there are no extended mnemonics for relative and absolute unconditional branches. For these the 
basic mnemonics b, ba, bl, and bla should be used. 





























Table 3. Simple branch mnemonics 
LR not Set LR Set 
Branch Semantics be bea belr bectr bel bela belrl bectrl 
Relative | Absolute | To LR | To CTR | Relative | Absolute | To LR | To CTR 
Branch unconditionally = = blr betr - = blrl betrl 
Branch if CRp)= 1 bt bta btlr btctr btl btla btlrl btctrl 
Branch if CRp)=0 bf f bflrl bfctrl 
Decrement CTR, 
branche ra nongere bdnz bdnza bdnzlir - bdnzl bdnzla | bdnzirl - 
Decrement CTR, 
branch if CTR nonzero bdnzt bdnzta | bdnztlr - bdnztl bdnztla | bdnztlrl - 
and CRp)= 1 
Decrement CTR, 
branch if CTR nonzero bdnzf bdnzfa | bdnzflr - bdnzfl bdnzfla | bdnzflrl - 
and CRp= 0 
Decrement CTR, bdzirl _ 
branch if CTR zero 
Decrement CTR, 
branch if CTR zero bdzt bdzta bdztlr - bdztl bdztla bdztlrl - 
and CRp)= 1 
Decrement CTR, 
branch if CTR zero bdzf bdzfa bdzflr - bdzfl bdzfla bdzflrl - 
and CRp)= 0 
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Examples 


1. Decrement CTR and branch if it is still nonzero (closure of a loop controlled by a count loaded into CTR). 


bdnz_ target (equivalent to: bc 16,0,target) 
2. Same as (1) but branch only if CTR is nonzero and condition in CRO is “equal”. 

bdnzt eq,target (equivalent to: bc 8,2,target) 
3. Same as (2), but “equal” condition is in CRS. 

bdnzt 4*cr5+eq,target (equivalent to: bc 8,22,target) 
4. Branch if bit 27 of CR is 0. 

bf 27,target (equivalent to: bc 4,27,target) 


5. Same as (4), but set the Link Register. This is a form of conditional “call”. 


bfl 27,target (equivalent to: bel 4,27,target) 


B.2.3 Branch Mnemonics Incorporating Conditions 


In the mnemonics defined in Table 4 on page 146, the test of a bit in a Condition Register field is encoded in the 


mnemonic. 


Instructions using the mnemonics in Table 4 specify the Condition Register field as an optional first operand. 


One 


of the CR field symbols defined in Section B.1 can be used for this operand. If the CR field being tested is CR 
Field 0, this operand need not be specified unless the resulting basic mnemonic is belr[f| or bectr[f and the BH 


operand is specified. 
A standard set of codes has been adopted for the most common combinations of branch conditions. 


Code Meaning 


It Less than 
le Less than or equal 
eq Equal 


ge Greater than or equal 

gt Greater than 

nl Not less than 

ne Not equal 

ng Not greater than 

so Summary overflow 

ns Not summary overflow 

un Unordered (after floating-point comparison) 

nu Not unordered (after floating-point comparison) 


These codes are reflected in the mnemonics shown in Table 4. 
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Table 4. Branch mnemonics incorporating conditions 
































LR not Set LR Set 

Branch Semantics be belr bectr belrl | bectri 
Relative | Absolute | To LR | To CTR | Relative | Absolute | To LR | To CTR 

Branch if less than bit bita bltlr bltctr bitl bitla bitirl bltctrl 
Branch if less than or equal ble blea blelr blectr blel blela blelrl blectrl 
Branch if equal beq beqa bealr beqctr beql beqla bealrl | beqctrl 
Branch if greater than or equal bge bgelr bgelrl | bgectrl 
Branch if greater than bgt bgta bgtlr bgtctr bgtl bgtla bgilrl bgtctrl 
Branch if not less than bnl bnla bnilr bnictr bnil bnila bnilrl bnictrl 
Branch if not equal bne bnea bnelr bnectr bnel bnela bnelrl | bnectrl 
Branch if summary overflow bso bsoa bsolr bsoctr bsol bsola bsolrl | bsoctrl 
Branch if not summary overflow bns bnsa bnslr bnsctr bnsl bnsla bnslrl | bnsctrl 
Branch if unordered bun buna bunlr bunctr bunl bunla bunirl | bunctrl 

















Branch if not unordered bnu bnulr bnuctr bnulrl | bnuctrl 


Examples 


1. Branch if CRO reflects condition “not equal’. 

bne _ target (equivalent to: bc 4,2,target) 
2. Same as (1), but condition is in CR3. 

bne__cr3,target (equivalent to: bc 4,14,target) 


3. Branch to an absolute target if CR4 specifies “greater than”, setting the Link Register. This is a form of 
conditional “call”. 


bgtla cr4,target (equivalent to: bcla 12,17,target) 
4. Same as (3), but target address is in the Count Register. 


bgtctrl cr4 (equivalent to: bectrl 12,17,0) 


B.2.4 Branch Prediction 


Software can use the “at” bits of Branch Conditional instructions to provide a hint to the processor about the 
behavior of the branch. If, for a given such instruction, the branch is almost always taken or almost always not 
taken, a suffix can be added to the mnemonic indicating the value to be used for the “at” bits. 


+ Predict branch to be taken (at=0b11) 
— Predict branch not to be taken (at=0b10) 


Such a suffix can be added to any Branch Conditional mnemonic, either basic or extended, that tests either the 
Count Register or a CR bit (but not both). Assemblers should use 0b00 as the default value for the “at” bits, 
indicating that software has offered no prediction. 


Examples 


1. Branch if CRO reflects condition “less than”, specifying that the branch should be predicted to be taken. 
blt+ target 

2. Same as (1), but target address is in the Link Register and the branch should be predicted not to be taken. 
bltlr— 
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B.3 Condition Register Logical Mnemonics 


The Condition Register Logical instructions can be used to set (to 1), clear (to 0), copy, or invert a given Condition 
Register bit. Extended mnemonics are provided that allow these operations to be coded easily. 





Table 5. Condition Register logical mnemonics 


Operation Extended Mnemonic Equivalent to 











Condition Register set crset bx creqv bx,bx,bx 
Condition Register clear crelr bx crxor bx,bx,bx 
Condition Register move crmove bx,by cror bx,by,by 











Condition Register not crnot bx,by crnor bx,by,by 
The symbols defined in Section B.1 can be used to identify the Condition Register bits. 


Examples 


1. Set CR bit 25. 


crset 25 (equivalent to: creqv 25,25,25) 
2. Clear the SO bit of CRO. 
crclr so (equivalent to: crxor 3,3,3) 
3. Same as (2), but SO bit to be cleared is in CR3. 
crclr 4*cr3+so (equivalent to: crxor 15,15,15) 
4. Invert the EQ bit. 
crnot eq,eq (equivalent to: crnor 22,2) 
5. Same as (4), but EQ bit to be inverted is in CR4, and the result is to be placed into the EQ bit of CRS5. 
crnot 4*cr5+eq,4*cr4+eq (equivalent to: crnor 22,18,18) 


B.4 Subtract Mnemonics 


B.4.1 Subtract Immediate 


Although there is no “Subtract Immediate” instruction, its effect can be achieved by using an Add Immediate 
instruction with the immediate operand negated. Extended mnemonics are provided that include this negation, 
making the intent of the computation clearer. 


subi Rx,Ry,value (equivalent to: addi Rx,Ry,— value) 
subis Rx,Ry,value (equivalent to: addis Rx,Ry,— value) 
subic Rx,Ry,value (equivalent to: addic Rx,Ry,— value) 
subic. Rx,Ry,value (equivalent to: addic. Rx,Ry,— value) 
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B.4.2 Subtract 


The Subtract From instructions subtract the second operand (RA) from the third (RB). Extended mnemonics are 
provided that use the more “normal” order, in which the third operand is subtracted from the second. Both these 
mnemonics can be coded with a final “o” and/or “.” to cause the OE and/or Rc bit to be set in the underlying 
instruction. 


sub Rx,Ry,Rz (equivalent to: subf Rx,Rz,Ry) 
subc Rx,Ry,Rz (equivalent to: subfc Rx,Rz,Ry) 


B.5 Compare Mnemonics 


The L field in the fixed-point Compare instructions controls whether the operands are treated as 64-bit quantities 
or as 32-bit quantities. Extended mnemonics are provided that represent the L value in the mnemonic rather than 
requiring it to be coded as a numeric operand. 


The BF field can be omitted if the result of the comparison is to be placed into CR Field 0. Otherwise the target 
CR field must be specified as the first operand. One of the CR field symbols defined in Section B.1 can be used 
for this operand. 


Note: The basic Compare mnemonics of PowerPC are the same as those of POWER, but the POWER instructions 
have three operands while the PowerPC instructions have four. The Assembler will recognize a basic Compare 
mnemonic with three operands as the POWER form, and will generate the instruction with L=0. (Thus the Assem- 
bler must require that the BF field, which normally can be omitted when CR Field 0 is the target, be specified 
explicitly if L is.) 
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B.5.1 Doubleword Comparisons 


Table 6. Doubleword compare mnemonics 
Operation 


Extended Mnemonic 


Equivalent to 





Compare doubleword immediate 


cmpdi bf,ra,si 


cmpi bf,1,ra,si 





Compare doubleword 


Compare logical doubleword immediate cmpldi_ bf,ra,ui 


Compare logical doubleword 


cmpd bf,ra,rb 








cmpld bf,ra,rb 


cmp bf,1,ra,rb 
cmpli bf,1,ra,ui 


cmpl bf,1,ra,rb 








Examples 


1. Compare register Rx and immediate value 100 as unsigned 64-bit integers and place result into CRO. 
0,1,Rx,100) 


cmpldi Rx,100 
2. Same as (1), but place result into CR4. 
cmpldi cr4,Rx,100 


(equivalent to: cmpili 


(equivalent to: cmpili 


3. Compare registers Rx and Ry as signed 64-bit integers and place result into CRO. 


cmpd_Rx,Ry 


B.5.2 Word Comparisons 


(equivalent to: cmp 


4,1,Rx,100) 


0,1,Rx,Ry) 





Table 7. Word compare mnemonics 





Operation 


Extended Mnemonic 





Compare word immediate cmpwi bf,ra,si 


Compare word 


cmpw bf,ra,rb 


Equivalent to 
cmpi bf,0,ra,si 


cmp bf,0,ra,rb 





Compare logical word immediate 


cmplwi bf,ra,ui 


cmpli bf,0,ra,ui 





Compare logical word 








cmplw bf,ra,rb 


cmpl bf,0,ra,rb 





Examples 
1. Compare bits 32:63 of register Rx and immediate value 100 as signed 32-bit integers and place result into 
CRO. 
cmpwi Rx,100 (equivalent to: cmpi __0,0,Rx,100) 
2. Same as (1), but place result into CR4. 
cmpwi_ cr4,Rx,100 (equivalent to: cmpi 4,0,Rx,100) 
3. Compare bits 32:63 of registers Rx and Ry as unsigned 32-bit integers and place result into CRO. 
cmplw_ Rx,Ry (equivalent to: cmpl 0,0,Rx,Ry) 
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B.6 Trap Mnemonics 


The mnemonics defined in Table 8 are variations of the Trap instructions, with the most useful values of TO 
represented in the mnemonic rather than specified as a numeric operand. 


A standard set of codes has been adopted for the most common combinations of trap conditions. 


Code Meaning TO encoding <>=2 cs 
It Less than 16 10000 
le Less than or equal 20 10100 
eq Equal 4 00100 
ge Greater than or equal 12 01100 
gt Greater than 8 01000 
nl Not less than 12 01100 
ne Not equal 24 11000 
ng Not greater than 20 10100 
IIt Logically less than 2 00010 
lle Logically less than or equal 6 00110 
Ige Logically greater than or equal 5 00101 
Igt Logically greater than 1 00001 
Inl Logically not less than 5 00101 
Ing Logically not greater than 6 00110 
u Unconditionally with parameters 31 111141 
(none) Unconditional 31 111141 


These codes are reflected in the mnemonics shown in Table 8. 













Table 8. Trap mnemonics 





64-bit Comparison 


tdi td 
Immediate Register 


32-bit Comparison 








Trap Semantics tw 


Register 


twi 
Immediate 

















Trap unconditionally 





Trap unconditionally with parameters twu 





Trap if less than 


Trap if equal 

































Trap if greater than or equal twge 
Trap if greater than tdgti tdgt twgti twgt 
Trap if not equal tdnei tdne twnei twne 
Trap if not greater than tdngi tdng twngi twng 
Trap if logically less than tdllti tdllt twllti twllt 
Trap if logically greater than or equal tdlgei tdlge twlgei twlge 
Trap if logically greater than tdlgti tdigt twlgti twlgt 
Trap if logically not less than tdInii tdin| twinli twinl 


Examples 


1. Trap if register Rx is not 0. 
tdnei Rx,0 (equivalent to: tdi 24,Rx,0) 
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2. Same as (1), but comparison is to register Ry. 


tdne Rx,Ry (equivalent to: td 24,Rx,Ry) 
3. Trap if bits 32:63 of register Rx, considered as a 32-bit quantity, are logically greater than Ox7FF. 
twlgti Rx,0x7FF (equivalent to: twi 1,Rx,0x7FF) 


4. Trap unconditionally. 
trap (equivalent to: tw 31,0,0) 
5. Trap unconditionally with immediate parameters Rx and Ry 
tdu —— Rx, Ry (equivalent to: td 31,Rx,Ry) 


B.7 Rotate and Shift Mnemonics 


The Rotate and Shift instructions provide powerful and general ways to manipulate register contents, but can be 
difficult to understand. Extended mnemonics are provided that allow some of the simpler operations to be coded 
easily. 


Mnemonics are provided for the following types of operation. 


Extract Select a field of n bits starting at bit position b in the source register; left or right justify this field in 
the target register; clear all other bits of the target register to 0. 


Insert Select a left-justified or right-justified field of n bits in the source register; insert this field starting at 
bit position b of the target register; leave other bits of the target register unchanged. (No extended 
mnemonic is provided for insertion of a left-justified field when operating on doublewords, because 
such an insertion requires more than one instruction.) 


Rotate Rotate the contents of a register right or left n bits without masking. 
Shift Shift the contents of a register right or left n bits, clearing vacated bits to 0 (logical shift). 
Clear Clear the leftmost or rightmost n bits of a register to 0. 


Clear left and shift left 
Clear the leftmost b bits of a register, then shift the register left by n bits. This operation can be used 
to scale a (known nonnegative) array index by the width of an element. 


B.7.1 Operations on Doublewords 


All these mnemonics can be coded with a final “.” to cause the Rc bit to be set in the underlying instruction. 





Table 9. Doubleword rotate and shift mnemonics 
































Operation Extended Mnemonic Equivalent to 

Extract and left justify immediate extldi ra,rs,n,o (n> 0) ridicr ra,rs,b,n— 1 
Extract and right justify immediate extrdi ra,rs,n,o (n> 0) ridicl ra,rs,o+n,64— n 
Insert from right immediate insrdi ra,rs,n,o (n> 0) ridimi ra,rs,64— (b+n),b 
Rotate left immediate rotidi ra,rs,n ridicl ra,rs,n,O 

Rotate right immediate rotrdi ra,rs,n ridicl ra,rs,64— n,0 
Rotate left rotid ra,rs,rb ridcl ra,rs,rb,0 

Shift left immediate sildi ra,rs,n (n< 64) ridicr ra,rs,n,63-— n 
Shift right immediate srdi ra,rsyn_ (n< 64) ridicl ra,rs,64— n,n 
Clear left immediate clrldi ra,rs,n_ (n < 64) ridicl ra,rs,0,n 

Clear right immediate clrrdi ra,rs,n (n< 64) ridicr ra,rs,0,63— n 
Clear left and shift left immediate clrisiIdi ra,rs,o,n (n< b< 64) ridic ra,rs,n,b- n 
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Examples 


1. Extract the sign bit (bit 0) of register Ry and place the result right-justified into register Rx. 


extrdi Rx,Ry,1,0 (equivalent to: ridicl Rx,Ry,1,63) 
2. Insert the bit extracted in (1) into the sign bit (bit 0) of register Rz. 

insrdi Rz,Rx,1,0 (equivalent to: rildimi Rz,Rx,63,0) 
3. Shift the contents of register Rx left 8 bits. 

sldi Rx,Rx,8 (equivalent to: rildicr Rx,Rx,8,55) 


4. Clear the high-order 32 bits of register Ry and place the result into register Rx. 
clrldi Rx,Ry,32 (equivalent to: rldicl Rx,Ry,0,32) 


B.7.2 Operations on Words 


All these mnemonics can be coded with a final “.” to cause the Rc bit to be set in the underlying instruction. The 
operations as described above apply to the low-order 32 bits of the registers, as if the registers were 32-bit regis- 
ters. The Insert operations either preserve the high-order 32 bits of the target register or place rotated data 
there; the other operations clear these bits. 












































Table 10. Word rotate and shift mnemonics 
Operation Extended Mnemonic Equivalent to 
Extract and left justify immediate extlwi ra,rs,n,o (n> 0) rlwinm ra,rs,b,0,n— 1 
Extract and right justify immediate extrwi ra,rs,n,o (n> 0) rlwinm ra,rs,b+n,32- n,31 
Insert from left immediate inslwi ra,rs,n,o (n> 0) rlwimi ra,rs,32— b,b,(b+n)— 1 
Insert from right immediate insrwi ra,rs,n,o (n> 0) rlwimi ra,rs,32— (b+n),b,(b+n)— 1 
Rotate left immediate rotlwi ra,rs,n rlwinm ra,rs,n,0,31 
Rotate right immediate rotrwi ra,rs,n rlwinm ra,rs,32— n,0,31 
Rotate left rotlw ra,rs,rb rlwnm ra,rs,rb,0,31 
Shift left immediate slwi ra,rs,n_ (n < 32) rlwinm ra,rs,n,0,31— n 
Shift right immediate srwi ra,rs,n (n< 32) rlwinm ra,rs,32— n,n,31 
Clear left immediate clrlwi ra,rs,n_  (n < 32) rlwinm ra,rs,0,n,31 
Clear right immediate clrrwi ra,rs,n (n< 32) rlwinm ra,rs,0,0,31— n 
Clear left and shift left immediate clrislwi ra,rs,o,n_ (n< b < 32) rlwinm ra,rs,n,o-— n,g1i-— n 
Examples 


1. Extract the sign bit (bit 32) of register Ry and place the result right-justified into register Rx. 

extrwi Rx,Ry,1,0 (equivalent to: rlwinm Rx,Ry,1,31,31) 
2. Insert the bit extracted in (1) into the sign bit (bit 32) of register Rz. 

insrwi Rz,Rx,1,0 (equivalent to: rlwimi Rz,Rx,31,0,0) 
3. Shift the contents of register Rx left 8 bits, clearing the high-order 32 bits. 

slwi Rx,Rx,8 (equivalent to: rlwinm Rx,Rx,8,0,23) 


4. Clear the high-order 16 bits of the low-order 32 bits of register Ry and place the result into register Rx, 
clearing the high-order 32 bits of register Rx. 


clrlwi Rx,Ry,16 (equivalent to: rlwinm Rx,Ry,0,16,31) 
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B.8 Move To/From Special Purpose Register Mnemonics 


The mitspr and mfspr instructions specify a Special Purpose Register (SPR) as a numeric operand. Extended mne- 
monics are provided that represent the SPR in the mnemonic rather than requiring it to be coded as an operand. 





Table 11. Extended mnemonics for moving to/from an SPR 


; Move To SPR Move From SPR 
Special Purpose Register 


Extended Equivalent to Extended Equivalent to 





Fixed-Point Exception Register (XER) mtxer Rx mtspr 1,Rx mfxer Rx mfspr Rx,1 





Link Register (LR) mtlr Rx mtspr 8,Rx mflr Rx mfspr Rx,8 


Count Register (CTR) mtctr Rx mtspr 9,Rx mfctr Rx mfspr Rx,9 














Examples 


1. Copy the contents of register Rx to the XER. 

mtxer Rx (equivalent to: mtspr 1,Rx) 
2. Copy the contents of the LR to register Rx. 

mflr Rx (equivalent to: mfspr Rx,8) 
3. Copy the contents of register Rx to the CTR. 


mtctr Rx (equivalent to: mtspr 9,Rx) 


B.9 Miscellaneous Mnemonics 


No-op 

Many PowerPC instructions can be coded in a way such that, effectively, no operation is performed. An extended 
mnemonic is provided for the preferred form of no-op. If an implementation performs any type of run-time opti- 
mization related to no-ops, the preferred form is the no-op that will trigger this. 


nop (equivalent to: ori 0,0,0) 
Load Immediate 
The addi and addis instructions can be used to load an immediate value into a register. Extended mnemonics are 
provided to convey the idea that no addition is being performed but merely data movement (from the immediate 
field of the instruction to a register). 
Load a 16-bit signed immediate value into register Rx. 

li Rx,value (equivalent to: addi Rx,0,value) 


Load a 16-bit signed immediate value, shifted left by 16 bits, into register Rx. 


lis Rx,value (equivalent to: addis Rx,0,value) 
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Load Address 


This mnemonic permits computing the value of a base-displacement operand, using the addi instruction which 
normally requires separate register and immediate operands. 


la Rx,D(Ry) (equivalent to: addi_ Rx,Ry,D) 
The la mnemonic is useful for obtaining the address of a variable specified by name, allowing the Assembler to 
supply the base register number and compute the displacement. If the variable v is located at offset Dv bytes 
from the address in register Rv, and the Assembler has been told to use register Rv as a base for references to 
the data structure containing v, then the following line causes the address of v to be loaded into register Rx. 


la Rx,v (equivalent to: addi _ Rx,Rv,Dv) 


Move Register 

Several PowerPC instructions can be coded in a way such that they simply copy the contents of one register to 
another. An extended mnemonic is provided to convey the idea that no computation is being performed but 
merely data movement (from one register to another). 


The following instruction copies the contents of register Ry to register Rx. This mnemonic can be coded with a 
final “.” to cause the Rc bit to be set in the underlying instruction. 


mr Rx,Ry (equivalent to: or Rx,Ry,Ry) 


Complement Register 

Several PowerPC instructions can be coded in a way such that they complement the contents of one register and 
place the result into another register. An extended mnemonic is provided that allows this operation to be coded 
easily. 


The following instruction complements the contents of register Ry and places the result into register Rx. This 
mnemonic can be coded with a final “.” to cause the Rc bit to be set in the underlying instruction. 


not Rx,Ry (equivalent to: nor  Rx,Ry,Ry) 


Move To/From Condition Register 


This mnemonic permits copying the contents of the low-order 32 bits of a GPR to the Condition Register, using the 
same style as the mfer instruction. 


mtcr Rx (equivalent to: mtcrf OxFF,Rx) 


The following instructions may generate either the (old) mterf or mfcr instructions or the (new) mtocrf or mfocrf 
instruction, respectively, depending on the target machine type assembler parameter. 


mtcrf FXM,Rx 
mfcr Rx 


All three extended mnemonics in this subsection are being phased out. In future assemblers the form "mtcr Rx" 
may not exist, and the mtcrf and mfer mnemonics may generate the old form instructions (with bit 11 = 0) 
regardless of the target machine type assembler parameter, or may cease to exist. 
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Appendix C. Programming Examples 


C.1 Multiple-Precision Shifts 


This section gives examples of how multiple-precision 
shifts can be programmed. 


A multiple-precision shift is defined to be a shift of an 
N-doubleword quantity (64-bit mode) or an N-word 
quantity (32-bit mode), where N>1. The quantity to be 
shifted is contained in N registers. The shift amount 
is specified either by an immediate value in the 
instruction or by a value in a register. 


The examples shown below distinguish between the 
cases N=2 and N>2. If N=2, the shift amount may be 
in the range 0 through 127 (64-bit mode) or 0 through 
63 (32-bit mode), which are the maximum ranges sup- 
ported by the Shift instructions used. However if 
N>2, the shift amount must be in the range 0 through 
63 (64-bit mode) or O through 31 (32-bit mode), in 
order for the examples to yield the desired result. 
The specific instance shown for N>2 is N=3; 
extending those code sequences to larger N is 
straightforward, as is reducing them to the case N=2 


when the more stringent restriction on shift amount is 
met. For shifts with immediate shift amounts only the 
case N=3 is shown, because the more stringent 
restriction on shift amount is always met. 


In the examples it is assumed that GPRs 2 and 3 (and 
4) contain the quantity to be shifted, and that the 
result is to be placed into the same registers, except 
for the immediate left shifts in 64-bit mode for which 
the result is placed into GPRs 3, 4, and 5. In all 
cases, for both input and result, the lowest-numbered 
register contains the highest-order part of the data 
and highest-numbered register contains the lowest- 
order part. For non-immediate shifts, the shift 
amount is assumed to be in GPR 6. For immediate 
shifts, the shift amount is assumed to be greater than 
0. GPRs 0 and 31 are used as scratch registers. 


For N>2, the number of instructions required is 2N—1 
(immediate shifts) or 3N— 1 (non-immediate shifts). 
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Multiple-precision shifts in 64-bit mode 


Shift Left Immediate, N = 3 (shift amnt < 64) 


rldicr r5,r4,sh,63-sh 
rldimi r4,r3,0,sh 
ridicl r4,r4,sh,0 
rildimi r3,r2,0,sh 
ridicl r3,r3,sh,0 

Shift Left, N= 2 (shift amnt < 128) 
subfic r31,1r6,64 
sid r2,r2,r6 
srd r0,r3,r31 
or r2,r2,r0 
addi 131,1r6,-64 
sid r0,r3,r31 
or r2,r2,r0 
sid r3,r3,r6 

Shift Left, N = 3 (shift amnt < 64) 
subfic r31,1r6,64 
sid r2,r2,r6 
srd r0,r3,r31 
or r2,r2,r0 
sid r3,r3,r6 
srd r0,r4,r31 
or r3,r3,r0 
sid r4,r4,r6 

Shift Right Immediate, N = 3 (shift amnt < 64) 
rildimi r4,r3,0,64-sh 
ridicl r4,r4,64-sh,0 
rildimi r3,r2,0,64-sh 
ridicl r3,r3,64-sh,0 
ridicl r2,r2,64-sh,sh 


Shift Right, N = 2 (shift amnt < 128) 


subfic 
srd 
sid 

or 
addi 
srd 

or 

srd 


r31,1r6,64 
r3,r3,r6 
r0,r2,r31 
r3,r3,r0 
r31,1r6,-64 
r0,r2,r31 
r3,r3,r0 
r2,r2,r6 


Shift Right, N = 3 (shift amnt < 64) 


subfic 
srd 
sid 

or 

srd 
sid 

or 

srd 


r31,16,64 
r4,r4,r6 
r0,r3,r31 
r4,r4,r0 
r3,r3,r6 
r0,r2,r31 
r3,r3,r0 
r2,r2,r6 


Multiple-precision shifts in 32-bit mode 


Shift Left Immediate, N = 3 (shift amnt < 32) 


rlwinm 
rlwimi 
rlwinm 
rlwimi 
rlwinm 


r2,r2,sh,0,31-sh 
r2,r3,sh,32-sh,31 
r3,r3,sh,0,31-sh 
r3,r4,sh,32-sh,31 
r4,r4,sh,0,31-sh 


Shift Left, N = 2 (shift amnt < 64) 


subfic 
slw 
srw 
or 
addi 
slw 

or 

slw 


r31,16,32 
r2,r2,r6 
r0,r3,r31 
r2,r2,r0 
r31,16,-32 
r0,r3,r31 
r2,r2,r0 
r3,r3,r6 


Shift Left, N = 3 (shift amnt < 32) 


subfic 
slw 
srw 
or 

slw 
srw 
or 

slw 


131,16,32 
r2,r2,r6 
r0,r3,r31 
r2,r2,r0 
r3,r3,r6 
r0,r4,r31 
r3,r3,r0 
r4,r4,r6 


Shift Right Immediate, N = 3 (shift amnt < 32) 


rlwinm 
rlwimi 
rlwinm 
rlwimi 
rlwinm 


r4,r4,32-sh,sh,31 
r4,r3,32-sh,0,sh-1 
r3,r3,32-sh,sh,31 
r3,r2,32-sh,0,sh-1 
r2,r2,32-sh,sh,31 


Shift Right, N = 2 (shift amnt < 64) 


subfic 
srw 
slw 

or 
addi 
srw 
or 
srw 


131,16,32 
r3,r3,r6 
r0,r2,r31 
r3,r3,r0 
r31,16,-32 
r0,r2,r31 
r3,r3,r0 
r2,r2,r6 


Shift Right, N = 3 (shift amnt < 32) 


subfic 
srw 
slw 

or 
srw 
slw 

or 
srw 
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r4,r4,r6 
r0,r3,r31 
r4,r4,r0 
r3,r3,r6 
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Multiple-precision shifts in 64-bit mode, 
continued 


Multiple-precision shifts in 32-bit mode, 
continued 


Shift Right Algebraic Immediate, N = 3 (shift amnt < 64) Shift Right Algebraic Immediate, N = 3 (shift amnt < 32) 


rildimi r4,r3,0,64-sh rlwinm r4,r4,32-sh,sh,31 
ridicl r4,r4,64-sh,0 rlwimi r4,r3,32-sh,0,sh-1 
rildimi r3,r2,0,64-sh rlwinm 3,r3,32-sh,sh,31 
ridicl r3,1r3,64-sh,0 rlwimi r3,r2,32-sh,0,sh-1 
sradi r2,r2,sh srawi r2,r2,sh 


Shift Right Algebraic, N = 2 (shift amnt < 128) 


Shift Right Algebraic, N = 2 (shift amnt < 64) 


subfic r31,r6,64 subfic r31,r6,32 
srd r3,r3,r6 srw r3,r3,r6 
sid r0,r2,r31 slw r0,r2,r31 
or r3,r3,r0 or r3,r3,r0 
addic. r31,r6,-64 addic. r31,r6,-32 
srad r0,r2,r31 sraw r0,r2,r31 
ble $4+8 ble $4+8 

ori r3,r0,0 ori r3,r0,0 
srad r2,r2,r6 sraw r2,r2,r6 


Shift Right Algebraic, N = 3 (shift amnt < 64) 


Shift Right Algebraic, N = 3 (shift amnt < 32) 


subfic r31,r6,64 subfic r31,r6,32 
srd r4,r4,r6 srw r4,r4,r6 
sid r0,r3,r31 slw r0,r3,r31 
or r4,r4,r0 or r4,r4,r0 
srd r3,r3,r6 srw r3,r3,r6 
sid r0,r2,r31 slw r0,r2,r31 
or r3,r3,r0 or r3,r3,r0 
srad r2,r2,r6 sraw r2,r2,r6 
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C.2 Floating-Point Conversions 


This section gives examples of how the Floating-Point 
Conversion instructions can be used to perform 
various conversions. 


C.2.1 Conversion from 
Floating-Point Number to 
Floating-Point Integer 


The full convert to floating-point integer function can 
be implemented with the sequence shown below, 
assuming the floating-point value to be converted is 
in FPR 1 and the result is returned in FPR 3. 


mtfsbdO 23 #clear VXCVI 
fctid[z] £3,f1 #convert to fx int 
fcfid £3, £3 #convert back again 
merfs 7,5 #VXCVI to CR 

bf 31,548 #skip if VXCVI was 0 
fmr £3,f1 #input was fp int 


C.2.2 Conversion from Floating-Point 
Number to Signed Fixed-Point Integer 
Doubleword 


The full convert to signed fixed-point integer 
doubleword function can be implemented with the 
sequence shown below, assuming the floating-point 
value to be converted is in FPR 1, the result is 
returned in GPR 3, and a doubleword at displacement 
“disp” from the address in GPR 1 can be used as 
scratch space. 


fetid[z] £2,f1 


stfd £2, disp (r1) 
ld r3,disp(r1) 


#convert to dword int 
#store float 
#load dword 


Warning: Some of the examples use the optional fsel 
instruction. Care must be taken in using fsel if IEEE 
compatibility is required, or if the values being tested 
can be NaNs or infinities; see Section C.3.4, “Notes” 
on page 160. 


C.2.3 Conversion from Floating-Point 
Number to Unsigned Fixed-Point 
Integer Doubleword 


The full convert to unsigned fixed-point integer 
doubleword function can be implemented with the 
sequence shown below, assuming the floating-point 
value to be converted is in FPR 1, the value 0 is in 
FPR 0, the value 264— 2048 is in FPR 3, the value 2° is 
in FPR 4 and GPR 4, the result is returned in GPR 3, 
and a doubleword at displacement “disp” from the 
address in GPR 1 can be used as scratch space. 


fsel f2,f1,f1,f0 #use 0 if < 0 
fsub £5,£3,f1 #use max if > max 
fsel £2,£5,£2,£3 
fsub £5,£2,£4 
fompu cr2,£2,£4 
fsel £2,£5,£5,£2 
fctid[z] £2,f2 

stfd £2, disp (rl) 


#subtract 2**63 
#use diff if > 2**63 


#convert to fx int 
#store float 


ld r3,disp(r1)  #load dword 
blt cr2, $+8 #add 2**63 if input 
add 13,73,r4 # was > 2**63 


C.2.4 Conversion from Floating-Point 
Number to Signed Fixed-Point Integer 
Word 


The full convert to signed fixed-point integer word 
function can be implemented with the sequence 
shown below, assuming the floating-point value to be 
converted is in FPR 1, the result is returned in GPR 3, 
and a doubleword at displacement “disp” from the 
address in GPR 1 can be used as scratch space. 


fctiw[z] £2,f1 #convert to fx int 
stfd f2,disp(r1)  #store float 
lwa r3,disp+4(r1) #load word algebraic 
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C.2.5 Conversion from Floating-Point 
Number to Unsigned Fixed-Point 
Integer Word 


The full convert to unsigned fixed-point integer word 
function can be implemented with the sequence 
shown below, assuming the floating-point value to be 
converted is in FPR 1, the value 0 is in FPR 0, the 
value 292-1 is in FPR 3, the result is returned in GPR 
3, and a doubleword at displacement “disp” from the 
address in GPR 1 can be used as scratch space. 


fsel f2,f1,f1,f0  #use 0 if < 0 

fsub £4,£3,f1 #use max if > max 
fsel £2,£4,£2,f3 

fctid[z] £2,£2 #convert to fx int 
stfd f2,disp(r1)  #store float 

lwz r3,disp+4(r1) #load word and zero 


C.2.6 Conversion from Signed 
Fixed-Point Integer Doubleword to 
Floating-Point Number 


The full convert from signed fixed-point integer 
doubleword function, using the rounding mode speci- 
fied by FPSCRpy, can be implemented with the 
sequence shown below, assuming the fixed-point 
value to be converted is in GPR 3, the result is 
returned in FPR 1, and a doubleword at displacement 
“disp” from the address in GPR 1 can be used as 
scratch space. 


std r3,disp(r1)  #store dword 
lfd fl,disp(r1)  #load float 
fcfid Flt) #convert to fp int 


C.2.7 Conversion from Unsigned 
Fixed-Point Integer Doubleword to 
Floating-Point Number 


The full convert from unsigned fixed-point integer 
doubleword function, using the rounding mode speci- 
fied by FPSCRpay, can be implemented with the 
sequence shown below, assuming the fixed-point 
value to be converted is in GPR 3, the value 2° is in 
FPR 4, the result is returned in FPR 1, and two 
doublewords at displacement “disp” from the address 
in GPR 1 can be used as scratch space. 


rldicl 12,r3,32,32 #isolate high half 
ridicl r0,r3,0,32 #isolate low half 

std r2,disp(r1)  #store dword both 

std r0,disp+8 (r1) 

lfd f2,disp(r1) #load float both 

lfd f1,disp+8 (r1) 

fcfid £2,f2 #convert each half to 
fcfid f1,f1 # fp int (exact result) 


fmadd £1,£4,f£2,f1 #(2**32)*high + low 


An alternative, shorter, sequence can be used if 
rounding according to FSCPRpy is desired and 
FPSCRpen specifies Round toward + Infinity or Round 
toward — Infinity, or if it is acceptable for the rounded 
answer to be either of the two representable floating- 
point integers nearest to the given fixed-point integer. 
In this case the full convert from unsigned fixed-point 
integer doubleword function can be implemented with 
the sequence shown below, assuming the value 2° is 
in FPR 2. 


std r3,disp(r1)  #store dword 
lfd fl,disp(r1)  #load float 
fcfid £1, £1 #convert to fp int 


fadd £4,f1,£2 Hadd 2**64 
fsel f1,f1,f1,f4 # if r3 <0 


C.2.8 Conversion from Signed 
Fixed-Point Integer Word to 
Floating-Point Number 


The full convert from signed fixed-point integer word 
function can be implemented with the sequence 
shown below, assuming the fixed-point value to be 
converted is in GPR 3, the result is returned in FPR 1, 
and a doubleword at displacement “disp” from the 
address in GPR 1 can be used as scratch space. (The 
result is exact.) 


extsw 13,73 #extend sign 


std r3,disp(r1)  #store dword 
lfd fl,disp(r1)  #load float 
fcfid £1, E1 #convert to fp int 


C.2.9 Conversion from Unsigned 
Fixed-Point Integer Word to 
Floating-Point Number 


The full convert from unsigned fixed-point integer 
word function can be implemented with the sequence 
shown below, assuming the fixed-point value to be 
converted is in GPR 3, the result is returned in FPR 1, 
and a doubleword at displacement “disp” from the 
address in GPR 1 can be used as scratch space. (The 
result is exact.) 


rldicl r0,r3,0,32 #zero-extend 
std r0,disp(r1)  #store dword 
lfd fl,disp(r1) #load float 
fcfid Fleet #convert to fp int 
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C.3 Floating-Point Selection 


This section gives examples of how the optional 
Floating Select instruction can be used to implement 
floating-point minimum and maximum functions, and 
certain simple forms of if-then-else constructions, 
without branching. 


The examples show program fragments in an imagi- 
nary, C-like, high-level programming language, and 
the corresponding program fragment using fsel and 
other PowerPC instructions. In the examples, a, b, x, 


C.3.1 Comparison to Zero 
High-level language: PowerPC: Notes 
if a>0.0 then x ¢y fsel fx,fa,fy,fz (1) 
else x €2Z 
if a> 0.0 thenx ¢y fneg fs,fa (1,2) 
else x ¢€2z fsel fx,fs,fz,fy 
if a-=0.0 thenx ¢y fsel fx,fa,fy,fz (1) 
else x ¢€2z fneg fs,fa 
fsel fx,fs,fx,fz 
C.3.2 Minimum and Maximum 
High-level language: PowerPC: Notes 
x € min(a,b) fsub fs,fa,fb (3,4,5) 
fsel fx,fs,fb,fa 
x € max(a,b) fsub fs,fa,fb (3,4,5) 
fsel fx,fs,fa,fb 
C.3.3 Simple if-then-else 
Constructions 
High-level language: PowerPC: Notes 
if a>bthenx ¢y fsub fs,fa,fb (4,5) 
else x ¢€2 fsel fx,fs,fy,fz 
if a>bthenx ¢y fsub fs,fb,fa (3,4,5) 
else x ¢€2z fsel fx,fs,fz,fy 
if a=b then x ¢y fsub fs,fa,fb (4,5) 
else x ¢€2z fsel fx,fs,fy,fz 
fneg fs,fs 
fsel fx,fs,fx,fz 





y, and z are floating-point variables, which are 
assumed to be in FPRs fa, fb, fx, fy, and fz. FPR fs is 
assumed to be available for scratch space. 


Additional examples can be found in Section C.2, 
“Floating-Point Conversions” on page 158. 


Warning: Care must be taken in using fsel if IEEE com- 
patibility is required, or if the values being tested can 
be NaNs or infinities; see Section C.3.4. 


C.3.4 Notes 


The following Notes apply to the preceding examples 
and to the corresponding cases using the other three 
arithmetic relations (<, <, and #). They should also 
be considered when any other use of fsel is contem- 
plated. 


In these Notes, the “optimized program” is the 
PowerPC program shown, and the “unoptimized 
program” (not shown) is the corresponding PowerPC 
program that uses fempu and Branch Conditional 
instructions instead of fsel. 


1. The unoptimized program affects the VXSNAN bit 
of the FPSCR, and therefore may cause the 
system error handler to be invoked if the corre- 
sponding exception is enabled, while the opti- 
mized program does not affect this bit. This 
property of the optimized program is incompat- 
ible with the IEEE standard. 


2. The optimized program gives the incorrect result 
if ais a NaN. 


3. The optimized program gives the incorrect result 
if a and/or b is a NaN (except that it may give the 
correct result in some cases for the minimum and 
maximum functions, depending on how those 
functions are defined to operate on NaNs). 


4. The optimized program gives the incorrect result 
if a and b are infinities of the same sign. (Here it 
is assumed that Invalid Operation Exceptions are 
disabled, in which case the result of the sub- 
traction is a NaN. The analysis is more compli- 
cated if Invalid Operation Exceptions are enabled, 
because in that case the target register of the 
subtraction is unchanged.) 


5. The optimized program affects the OX, UX, XX, 
and VXISI bits of the FPSCR, and therefore may 
cause the system error handler to be invoked if 
the corresponding exceptions are enabled, while 
the unoptimized program does not affect these 
bits. This property of the optimized program is 
incompatible with the IEEE standard. 
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Appendix D. Cross-Reference for Changed POWER 


Mnemonics 


The following table lists the POWER instruction mne- 
monics that have been changed in the PowerPC Archi- 
tecture, sorted by POWER mnemonic. 


To determine the PowerPC mnemonic for one of these 
POWER mnemonics, find the POWER mnemonic in the 
second column of the table: the remainder of the line 
gives the PowerPC mnemonic and the page or Book in 
which the instruction is described, as well as the 
instruction names. A page number is shown for 
instructions that are defined in this Book (Book l, 
PowerPC User Instruction Set Architecture), and the 


Book number is shown for instructions that are 
defined in other Books (Book II, PowerPC Virtual Envi- 
ronment Architecture, and Book Ill, PowerPC Oper- 
ating Environment Architecture). \f an instruction is 
defined in more than one of these Books, the lowest- 
numbered Book is used. 


POWER mnemonics that have not changed are not 
listed. POWER instruction names that are the same in 
PowerPC are not repeated; i.e., for these, the last 
column of the table is blank. 








Bk | Mnemonic Instruction Mnemonic Instruction 
51 |/afol][.] Add addc[o][.] Add Carrying 
52 |ae[o][.] Add Extended adde[o][.] 
50 Jai Add Immediate addic Add Immediate Carrying 
50 |ai. Add Immediate and Record addic. Add Immediate Carrying and Record 
52 |ame[o][.] Add To Minus One Extended addme[o][.] |Add to Minus One Extended 
62 |andil. AND Immediate Lower andi. AND Immediate 
62 jandiu. AND Immediate Upper andis. AND Immediate Shifted 
53 |aze[o][.] Add To Zero Extended addze[o][.] |Add to Zero Extended 
24 |bcc[l] Branch Conditional to Count Register |bcctr[I] 
24 |ber[l] Branch Conditional to Link Register |bclr[l] 
49 |cal Compute Address Lower addi Add Immediate 
49 |cau Compute Address Upper addis Add Immediate Shifted 
50 |cax[o][.] Compute Address add[o][.] Add 
67 |cntlz[.] Count Leading Zeros cntlzw[.] Count Leading Zeros Word 
Il} delz Data Cache Line Set to Zero dcbz Data Cache Block set to Zero 
Il|dcs Data Cache Synchronize sync Synchronize 
66 |exts[.] Extend Sign extsh[.] Extend Sign Halfword 
105 |fal[.] Floating Add fadd[.] 
106 |fd[.] Floating Divide fdiv[.] 
106 |fm[.] Floating Multiply fmul[.] 
107 |fmal[.] Floating Multiply-Add fmadd[.] 
107 |fms[.] Floating Multiply-Subtract fmsub[.] 
108 |fnma[.] Floating Negative Multiply-Add fnmadd[.] 
108 |fnms[.] Floating Negative Multiply-Subtract |fnmsub[.] 
105 /fs[.] Floating Subtract fsub[.] 
Iljics Instruction Cache Synchronize isync Instruction Synchronize 
35 || Load lwz Load Word and Zero 
42 ||brx Load Byte-Reverse Indexed Iwbrx Load Word Byte-Reverse Indexed 
44 |lm Load Multiple Imw Load Multiple Word 
46 |Isi Load String Immediate Iswi Load String Word Immediate 
46 |Isx Load String Indexed Iswx Load String Word Indexed 
35 |lu Load with Update Iwzu Load Word and Zero with Update 
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Page / 
Bk 


35 


35 
II 
54 
54 
63 
63 
73 


70 
72 


51 
52 
51 
52 
53 
74 
75 
77 
76 


40 
43 
44 
47 
47 
40 
40 
40 
25 
61 
60 
I 
63 
63 








POWER PowerPC 
Mnemonic Instruction Mnemonic Instruction 
lux Load with Update Indexed lwzux Load Word and Zero with Update 
Indexed 
Ix Load Indexed lwzx Load Word and Zero Indexed 
mtsri Move To Segment Register Indirect |mtsrin 
muli Multiply Immediate mulli Multiply Low Immediate 
muls[o][.] Multiply Short mullw[o][.] |Multiply Low Word 
oril OR Immediate Lower ori OR Immediate 
oriu OR Immediate Upper oris OR Immediate Shifted 
rlimi[.] Rotate Left Immediate Then Mask rlwimil[.] Rotate Left Word Immediate then 
Insert Mask Insert 
rlinm[.] Rotate Left Immediate Then AND rlwinm[.] Rotate Left Word Immediate then 
With Mask AND with Mask 
rinm[.] Rotate Left Then AND With Mask rlwnm[.] Rotate Left Word then AND with 
Mask 
sf[o][.] Subtract From subfc[ o][.] Subtract From Carrying 
sfe[o][.] Subtract From Extended subfe[o][.] 
sfi Subtract From Immediate subfic Subtract From Immediate Carrying 
sfme[o][.] Subtract From Minus One Extended |subfme[o][.] 
sfze[o][.] Subtract From Zero Extended subfze[o][.] 
sI[.] Shift Left slw[.] Shift Left Word 
sr[.] Shift Right srw[.] Shift Right Word 
sra[.] Shift Right Algebraic sraw[.] Shift Right Algebraic Word 
srail[.] Shift Right Algebraic Immediate srawi[.] Shift Right Algebraic Word Imme- 
diate 
st Store stw Store Word 
stbrx Store Byte-Reverse Indexed stwbrx Store Word Byte-Reverse Indexed 
stm Store Multiple stmw Store Multiple Word 
stsi Store String Immediate stswi Store String Word Immediate 
stsx Store String Indexed stswx Store String Word Indexed 
stu Store with Update stwu Store Word with Update 
stux Store with Update Indexed stwux Store Word with Update Indexed 
stx Store Indexed stwx Store Word Indexed 
svca Supervisor Call sc System Call 
t Trap tw Trap Word 
ti Trap Immediate twi Trap Word Immediate 
tlbi TLB Invalidate Entry tlbie 
xoril XOR Immediate Lower xori XOR Immediate 
xoriu XOR Immediate Upper xoris XOR Immediate Shifted 
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Appendix E. Incompatibilities with the POWER Architecture 


This appendix identifies the known incompatibilities 
that must be managed in the migration from the 
POWER Architecture to the PowerPC Architecture. 
Some of the incompatibilities can, at least in principle, 
be detected by the processor, which could trap and 
let software simulate the POWER operation. Others 
cannot be detected by the processor even in prin- 
ciple. 


E.1 New Instructions, Formerly 
Privileged Instructions 


Instructions new to PowerPC typically use opcode 
values (including extended opcode) that are illegal in 
POWER. A few instructions that are privileged in 
POWER (e.g., delz, called debz in PowerPC) have been 
made nonprivileged in PowerPC. Any POWER 
program that executes one of these now-valid or now- 
nonprivileged instructions, expecting to cause the 
system illegal instruction error handler or the system 
privileged instruction error handler to be invoked, will 
not execute correctly on PowerPC. 


E.2 Newly Privileged Instructions 


The following instructions are nonprivileged in POWER 
but privileged in PowerPC. 


mfmsr 
mfsr 


E.3 Reserved Fields in 
Instructions 


These fields are shown with “/”s in the instruction 
layouts. In both POWER and PowerPC these fields are 
ignored by the processor. The PowerPC Architecture 
states that these fields should be coded as zero. The 
POWER Architecture lacks such a statement, but it is 
expected that essentially all POWER programs contain 
zero in these fields. 


In general, the incompatibilities identified here are 
those that affect a POWER application program; 
incompatibilities for instructions that can be used only 
by POWER system programs are not necessarily dis- 
cussed. 


In several cases the PowerPC Architecture assumes 
that reserved fields in POWER instructions indeed 
contain zero. The cases include the following. 


a belr{!) and bectr[!| assume that bits 19:20 in the 
POWER instructions are 0. 

= cmpi, cmp, cmpli, and cmpl assume that bit 10 in 
the POWER instructions is 0. 

= mtspr and mfspr assume that bits 16:20 in the 
POWER instructions are 0. 

= mitcrf and mfcr assume that bit 11 in the POWER 
instructions is 0. 

= Synchronize assumes that bits 9:10 in the POWER 
instruction (des) are 0. (This assumption provides 
compatibility for application programs, but not 
necessarily for operating system programs; see 
Section E.22.) 


E.4 Reserved Bits in Registers 


Both POWER and PowerPC permit software to write any 
value to these bits. However in POWER reading sucha 
bit always returns 0, while in PowerPC reading it may 
return either O or the value that was last written to it. 


E.5 Alignment Check 


The POWER MSR AL bit (bit 24) is no longer supported; 
the corresponding PowerPC MSR bit, bit 56, is 
reserved. The low-order bits of the EA are always 
used. (Notice that the value 0 —the normal value for a 
reserved bit —means “ignore the low-order EA bits” in 
POWER, and the value 1 means “use the low-order EA 
bits”.) POWER-compatible operating system code will 
probably write the value 1 to this bit. 
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E.6 Condition Register 


The following instructions specify a field in the CR 
explicitly (via the BF field) and also, in POWER, use bit 
31 as the Record bit. In PowerPC, bit 31 is a reserved 
field for these instructions and is ignored by the 
processor. In POWER, if bit 31 contains 1 the 
instructions execute normally (i.e., as if the bit con- 
tained 0) except as follows: 


cmp CRO is undefined if Rc=1 and BF+0 
cmpl CRO is undefined if Rc=1 and BF+0 
merxr CRO is undefined if Rc=1 and BF#0 
fompu CR1 is undefined if Rc=1 
fcompo CR1 is undefined if Rc=1 
merfs CR1 is undefined if Rc=1 and BF# 1 


E.7 LK and Rc Bits 


For the instructions listed below, if bit 31 (LK or Rc bit 
in POWER) contains 1, in POWER the instruction exe- 
cutes as if the bit contained 0 with the following 
exceptions: if LK=1, the Link Register is set; if Rc=1, 
Condition Register Field 0 or 1 is set to an undefined 
value. In PowerPC, bit 31 is a reserved field for these 
instructions and is ignored by the processor. 


PowerPC instructions for which bit 31 is the LK bit in 
POWER: 


sc (svc in POWER) 

the Condition Register Logical instructions 
merf 

isync (ics in POWER) 


PowerPC instructions for which bit 31 is the Re bit in 
POWER: 


fixed-point X-form Load and Store instructions 
fixed-point X-form Compare instructions 

the X-form Trap instruction 

mtspr, mfspr, mtcrf, merxr, mfcr, mtocrf, mfocrf 
floating-point X-form Load and Store instructions 
floating-point Compare instructions 

merfs 

debz (delz in POWER) 


E.8 BO Field 


POWER shows certain bits in the BO field — used by 
Branch Conditional instructions —as “x”. Although the 
POWER Architecture does not say how these bits are to 
be interpreted, they are in fact ignored by the 
processor. 


PowerPC shows these bits as “z”, “a”, or “t”. The “z” 
bits are ignored, as in POWER. However, the “a” and 


“t” bits can be used by software to provide a hint about 
how the branch is likely to behave. If a POWER 
program has the “wrong” value for these bits, the 
program will produce the same results as on POWER 
but performance may be affected. 


E.9 BH Field 


Bits 19:20 of the Branch Conditional to Link Register 
and Branch Conditional to Count Register instructions 
are reserved in POWER but are defined as a branch 
hint (BH) field in PowerPC. Because these bits are 
hints, they may affect performance but do not affect 
the results of executing the instruction. 


E.10 Branch Conditional to Count 
Register 


For the case in which the Count Register is decre- 
mented and tested (i.e., the case in which BOp=0), 
POWER specifies only that the branch target address is 
undefined, with the implication that the Count Register, 
and the Link Register if LK=1, are updated in the 
normal way. PowerPC specifies that this instruction 
form is invalid. 


E.11 System Call 


There are several respects in which PowerPC is incom- 
patible with POWER for System Call instructions — 
which in POWER are called Supervisor Call 
instructions. 


m= POWER provides a version of the Supervisor Call 
instruction (bit 30 = 0) that allows instruction 
fetching to continue at any one of 128 locations. It 
is used for “fast SVCs”. PowerPC provides no 
such version: if bit 30 of the instruction is 0 the 
instruction form is invalid. 


m POWER provides a version of the Supervisor Call 
instruction (bits 30:31 = 0Ob11) that resumes 
instruction fetching at one location and sets the 
Link Register to the address of the next instruc- 
tion. PowerPC provides no such version: bit 31 is 
a reserved field. 


= For POWER, information from the MSR is saved in 
the Count Register. For PowerPC this information 
is saved in SRR1. 


m= In POWER bits 16:19 and 27:29 of the instruction 
comprise defined instruction fields or a portion 
thereof, while in PowerPC these bits comprise 
reserved fields. 
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= In POWER bits 20:26 of the instruction comprise a 
portion of the SV field, while in PowerPC these bits 
comprise the LEV field. 


m POWER saves the low-order 16 bits of the instruc- 
tion, in the Count Register. PowerPC does not 
save them. 


m The settings of MSR bits by the associated inter- 
rupt differ between POWER and PowerPC; see 
POWER Processor Architecture and Book Ill, 
PowerPC Operating Environment Architecture. 


E.12 Fixed-Point Exception 
Register (XER) 


Bits 48:55 of the XER are reserved in PowerPC, while in 
POWER the corresponding bits (16:23) are defined and 
contain the comparison byte for the Iscbx instruction 
(which PowerPC lacks). 


E.13 Update Forms of Storage 
Access Instructions 


PowerPC requires that RA not be equal to either RT 
(fixed-point Load only) or 0. If the restriction is vio- 
lated the instruction form is invalid. POWER permits 
these cases, and simply avoids saving the EA. 


E.14 Multiple Register Loads 


PowerPC requires that RA, and RB if present in the 
instruction format, not be in the range of registers to 
be loaded, while POWER permits this and does not 
alter RA or RB in this case. (The PowerPC restriction 
applies even if RA=0, although there is no obvious 
benefit to the restriction in this case since RA is not 
used to compute the effective address if RA=0.) If the 
PowerPC restriction is violated, either the system 
illegal instruction error handler is invoked or the 
results are boundedly undefined. The instructions 
affected are: 


Imw (Im in POWER) 
Iswi (Isi in POWER) 
Iswx (Isx in POWER) 


For example, an /mw instruction that loads all 32 regis- 
ters is valid in POWER but is an invalid form in 
PowerPC. 


E.15 Load/Store Multiple 
Instructions 


There are two respects in which PowerPC is incompat- 
ible with POWER for Load Multiple and Store Multiple 
instructions. 


a Ifthe EA is not word-aligned, in PowerPC either an 
Alignment exception occurs or the addressed 
bytes are loaded, while in POWER an Alignment 
interrupt occurs if MSRa,;=1 (the low-order two 
bits of the EA are ignored if MSR,,=0). 


m= In PowerPC the instruction may be interrupted by a 
system-caused interrupt, while in POWER the 
instruction cannot be thus interrupted. 


E.16 Move Assist Instructions 


There are several respects in which PowerPC is incom- 
patible with POWER for Move Assist instructions. 


m= In PowerPC an /swx instruction with zero length 
leaves the contents of RT undefined (if RT# RA and 
RT#RB) or is an invalid instruction form (if 
RT=RA or RT=RB), while in POWER the corre- 
sponding instruction (Isx) is a no-op in these cases. 


m= In PowerPC an /swx instruction with zero length 
may alter the Reference bit, and a stswx instruc- 
tion with zero length may alter the Reference and 
Change bits, while in POWER the corresponding 
instructions (Isx and stsx) do not alter the Refer- 
ence and Change bits in this case. 


m= In PowerPC a Move Assist instruction may be inter- 
rupted by a system-caused interrupt, while in 
POWER the instruction cannot be thus interrupted. 


E.17 Move To/From SPR 


There are several respects in which PowerPC is incom- 
patible with POWER for Move To/From Special Purpose 
Register instructions. 


= The SPR field is ten bits long in PowerPC, but only 
five in POWER (see also Section E.3, “Reserved 
Fields in Instructions” on page 163). 


= mfspr can be used to read the Decrementer in 
problem state in POWER, but only in privileged 
state in PowerPC. 


m If the SPR value specified in the instruction is not 
one of the defined values, POWER behaves as 
follows. 


— Ifthe instruction is executed in problem state 
and SPRo=1, a Privileged Instruction type 
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Program interrupt occurs. No architected reg- 
isters are altered except those set by the 
interrupt. 


— Otherwise no architected 


altered. 


registers are 


In this same case, PowerPC behaves as follows. 


— Ifthe instruction is executed in problem state 
and spro=1, either an Illegal Instruction type 
Program interrupt or a Privileged Instruction 
type Program interrupt occurs. No architected 
registers are altered except those set by the 
interrupt. 


— Otherwise either an Illegal Instruction type 
Program interrupt occurs (in which case no 
architected registers are altered except those 
set by the interrupt) or the results are 
boundedly undefined (or possibly undefined, 
for mtspr; see Book Ill). 


E.18 Effects of Exceptions on 
FPSCR Bits FR and Fl 


For the following cases, POWER does not specify how 
FR and FI are set, while PowerPC preserves them for 
Invalid Operation Exception caused by a Compare 
instruction, sets Fl to 1 and FR to an undefined value 
for disabled Overflow Exception, and clears them oth- 
erwise. 


a Invalid Operation Exception (enabled or disabled) 
m= Zero Divide Exception (enabled or disabled) 
= Disabled Overflow Exception 


E.19 Store Floating-Point Single 
Instructions 


There are several respects in which PowerPC is incom- 
patible with POWER for Store Floating-Point Single 
instructions. 


m POWER uses FPSCRyg to help determine whether 
denormalization should be done, while PowerPC 
does not. Using FPSCRye¢ is in fact incorrect: if 
FPSCRye= 1 and a denormalized single-precision 
number is copied from one storage location to 
another by means of /fs followed by stfs, the two 
“copies” may not be the same. 


m= For an operand having an exponent that is less 
than 874 (unbiased exponent less than — 149), 
POWER stores a zero (if FPSCRyg=0) while 
PowerPC stores an undefined value. 


E.20 Move From FPSCR 


POWER defines the high-order 32 bits of the result of 
mffs to be OxFFFF_FFFF, while PowerPC specifies that 
they are undefined. 


E.21 Zeroing Bytes in the Data 
Cache 


The delz instruction of POWER and the debz instruction 
of PowerPC have the same opcode. However, the func- 
tions differ in the following respects. 


m dclz clears a line while debz clears a block. 

m dclz saves the EA in RA (if RA#0) while debz does 
not. 

m dcilzis privileged while dcebz is not. 


E.22 Synchronization 


The Synchronize instruction (called des in POWER) and 
the isync instruction (called ics in POWER) cause more 
pervasive synchronization in PowerPC than in POWER. 
However, unlike des, Synchronize does not wait until 
data cache block writes caused by _ preceding 
instructions have been performed in main storage. 
Also, Synchronize has an L field while des does not, 
and some uses of the instruction by the operating 
system require L=2. (The L field corresponds to 
reserved bits in des and hence will be 0 in POWER pro- 
grams.) 


E.23 Move To Machine State 
Register Instruction 


The mtmsr instruction has an L field in PowerPC but 
not in POWER. The function of the variant of mtmsr 
with L=1 differs from the function of the instruction in 
the POWER architecture in the following ways. 


= In PowerPC, this variant of mtmsr modifies only 
the EE and RI bits of the MSR, while in the POWER 
mtmsr modifies all bits of the MSR. 

a This variant of mtmsr is execution synchronizing in 
PowerPC but is context synchronizing in POWER. 
(The POWER architecture lacks PowerPC's dis- 
tinction between execution synchronization and 
context synchronization. The statement in the 
POWER architecture specification that mtmsr is 
"synchronizing" is equivalent to stating that the 
instruction is context synchronizing.) 


Also, mtmsr is optional in PowerPC but required in 
POWER. 
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E.24 Direct-Store Segments 


POWER's direct-store segments are not supported in 
PowerPC. 


E.25 Segment Register 
Manipulation Instructions 


The definitions of the four Segment Register Manipu- 
lation instructions mtsr, mtsrin, mfsr, and mfsrin differ 
in two respects between POWER and PowerPC. 
Instructions similar to mtsrin and mfsrin are called 
mtsri and mfsriin POWER. 


privilege: mfsr and mfsri are problem state 
instructions in POWER, while mfsr and 
mfsrin are privileged in PowerPC. 


function: the “indirect” instructions (mtsri and 
mfsri) in POWER use an RA register in 
computing the Segment Register number, 
and the computed EA is stored into RA (if 
RA#0 and RA#RT), while in PowerPC 
mtsrin and mfsrin have no RA field and the 
EA is not stored. 


mtsr, mtsrin (mtsri), and mfsr have the same opcodes 
in PowerPC as in POWER. mfsri (POWER) and mfsrin 
(PowerPC) have different opcodes. 


Also, the Segment Register Manipulation instructions 
are required in POWER whereas they are optional in 
PowerPC. 


E.26 TLB Entry Invalidation 


The tlbi instruction of POWER and the tlbie instruction 
of PowerPC have the same opcode. However, the func- 
tions differ in the following respects. 


a tlbi computes the EA as (RA|O) + (RB), while tlbie 
lacks an RA field and computes the EA and related 
information as (RB). 

u tlbisaves the EA in RA (if RA#0), while tlbie lacks 
an RA field and does not save the EA. 

= For tlbi the high-order 36 bits of RB are used in 
computing the EA, while for tlbie these bits contain 
additional information that is not directly related to 
the EA. 

a tlbie has an L field, while tlbi does not. 


Also, tlbi is required in POWER whereas tlbie is 
optional in PowerPC. 


E.27 Alignment Interrupts 


Placing information about the interrupting instruction 
into the DSISR and the DAR when an Alignment inter- 
rupt occurs is optional in PowerPC but required in 
POWER. 


E.28 Floating-Point Interrupts 


POWER uses MSR bit 20 to control the generation of 
interrupts for floating-point enabled exceptions, and 
PowerPC uses the corresponding MSR bit, bit 52, for 
the same purpose. However, in PowerPC this bit is 
part of a two-bit value that controls the occurrence, 
precision, and recoverability of the interrupt, while in 
POWER this bit is used independently to control the 
occurrence of the interrupt (in POWER all floating-point 
interrupts are precise). 


E.29 Timing Facilities 


E.29.1 Real-Time Clock 


The POWER Real-Time Clock is not supported in 
PowerPC. Instead, PowerPC provides a Time Base. 
Both the RTC and the TB are 64-bit Special Purpose 
Registers, but they differ in the following respects. 


m= The RTC counts seconds and nanoseconds, while 
the TB counts “ticks”. The ticking rate of the TB is 
implementation-dependent. 

= The RTC increments discontinuously: 1 is added to 
RTCU when the value in RTCL_ passes 
999 999 999. The TB increments continuously: 1 
is added to TBU when the value in TBL passes 
OxFFFF_FFFF. 

= The RTC is written and read by the mtspr and 
mfspr instructions, using SPR numbers that denote 
the RTCU and RTCL. The TB is written by the 
mtspr instruction (using new SPR numbers), and 
read by the new mftb instruction. 

= The SPR numbers that denote POWER's RTCL and 
RTCU are invalid in PowerPC. 

m= The RTC is guaranteed to increment at least once 
in the time required to execute ten Add Immediate 
instructions. No analogous guarantee is made for 
the TB. 

= Not all bits of RTCL need be implemented, while all 
bits of the TB must be implemented. 
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E.29.2 Decrementer 


The PowerPC Decrementer differs from the POWER 
Decrementer in the following respects. 


m The PowerPC DEC decrements at the same rate 
that the TB increments, while the POWER DEC dec- 
rements every nanosecond (which is the same 
rate that the RTC increments). 

= Not all bits of the POWER DEC need be imple- 
mented, while all bits of the PowerPC DEC must be 
implemented. 

m The interrupt caused by the DEC has its own inter- 
rupt vector location in PowerPC, but is considered 
an External interrupt in POWER. 


E.30 Deleted Instructions 


The following instructions are part of the POWER Archi- 
tecture but have been dropped from the PowerPC 
Architecture. 


abs 
cles 
clf 

cli (*) 
dclst 
div 
divs 
doz 
dozi 
Iscbx 
maskg 
maskir 
mftsri 
mul 
nabs 
rac (*) 
rfi (*) 
rfsve (*) 
rlmi 
rrib 
sle 
sleq 
sliq 
slliq 
sllq 
slq 
sraiq 
sraq 
sre 
srea 
sreq 
sriq 
srliq 
srlq 
srq 


Absolute 

Cache Line Compute Size 

Cache Line Flush 

Cache Line Invalidate 

Data Cache Line Store 

Divide 

Divide Short 

Difference Or Zero 

Difference Or Zero Immediate 

Load String And Compare Byte Indexed 
Mask Generate 

Mask Insert From Register 

Move From Segment Register Indirect 
Multiply 

Negative Absolute 

Real Address Compute 

Return From Interrupt 

Return From SVC 

Rotate Left Then Mask Insert 

Rotate Right And Insert Bit 

Shift Left Extended 

Shift Left Extended With MQ 

Shift Left Immediate With MQ 

Shift Left Long Immediate With MQ 
Shift Left Long With MQ 

Shift Left With MQ 

Shift Right Algebraic Immediate With MQ 
Shift Right Algebraic With MQ 

Shift Right Extended 

Shift Right Extended Algebraic 

Shift Right Extended With MQ 

Shift Right Immediate With MQ 

Shift Right Long Immediate With MQ 
Shift Right Long With MQ 

Shift Right With MQ 


(*) This instruction is privileged. 


Note: Many of these instructions use the MQ register. 
The MQ is not defined in the PowerPC Architecture. 
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. . rfi (* 19 50 
E.31 Discontinued Opcodes wie oe 
rlmi 22 - 
The opcodes listed below are defined in the POWER Ke st ae 
Architecture but have been dropped from the PowerPC ae 
sleq 31 217 


Architecture. The list contains the POWER mnemonic 





(MNEM), the primary opcode (PRI), and the extended ain es 
opcode (XOP) if appropriate. The corresponding aie 31 216 
instructions are reserved in PowerPC. oud 
slq 31 152 
MNEM PRI XOP sraiq 31 952 
abs 31 360 sraq 31 920 
cles 31 531 sre 31 665 
clf 31 118 srea 31 921 
cli (*) 31 502 sreq 31 729 
dclst 31 630 sriq 31 696 
div 31 331 srliq 31 760 
divs 31 363 srlq 31 728 
doz 31 264 srq 31 664 
dozi 09 oe (*) This instruction is privileged. 
Iscbx 31 277 
maskg 31 29 Assembler Note 
maskir 31 541 
mfsri 31 627 It might be helpful to current software writers for 
mul 31 107 the Assembler to flag the discontinued POWER 
nabs 31 488 instructions. 
rac (*) 31 818 
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E.32 POWER2 Compatibility 


The POWER2 instruction set is a superset of the 
POWER instruction set. Some of the instructions added 
for POWER2 are included in the PowerPC Architecture. 
Those that have been renamed in the PowerPC Archi- 
tecture are listed in this section, as are the new 


POWER2 instructions that are not included in the 
PowerPC Architecture. 


Other incompatibilities are also listed. 


E.32.1 Cross-Reference for Changed POWER2 Mnemonics 


The following table lists the new POWER2 instruction 
mnemonics that have been changed in the PowerPC 
User Instruction Set Architecture, sorted by POWER2 
mnemonic. 


To determine the PowerPC mnemonic for one of these 
POWER2 mnemonics, find the POWER2 mnemonic in 


POWER2 


Mnemonic Instruction 


the second column of the table: the remainder of the 
line gives the PowerPC mnemonic and the page on 
which the instruction is described, as well as the 
instruction names. 


POWER2 mnemonics that have not changed are not 
listed. 


PowerPC 


Mnemonic Instruction 





with Round 
fcirz[.] 
with Round to Zero 





E.32.2 Floating-Point Conversion to 
Integer 


The feir and fcirz instructions of POWER2 have the 
same opcodes as do the fetiw and fetiwz instructions, 
respectively, of PowerPC. However, the functions 
differ in the following respects. 


a feir and feirzset the high-order 32 bits of the target 
FPR to OxFFFF_FFFF, while fctiw and fctiwz set 
them to an undefined value. 

= Except for enabled Invalid Operation Exceptions, 
feir and fcirz set the FPRF field of the FPSCR based 
on the result, while fetiw and fetiwz set it to an 
undefined value. 

= fcir and feirz do not affect the VXSNAN bit of the 
FPSCR, while fetiw and fetiwz do. 

a feir and feirz set FPSCRy x to 1 for certain cases of 
“Large Operands” (i.e., operands that are too 
large to be represented as a 32-bit signed fixed- 
point integer), while fetiw and fetiwz do not alter it 
for any case of “Large Operand”. (The IEEE 
standard requires not altering it for “Large Oper- 
ands”.) 


fcir[.] Floating Convert Double to Integer 


Floating Convert Double to Integer 





fctiw[.] Floating Convert To Integer Word 


fctiwz[.] Floating Convert To Integer Word 


with round toward Zero 





E.32.3 Floating-Point Interrupts 


POWER2 uses MSR bits 20 and 23 to control the gener- 
ation of interrupts for floating-point enabled 
exceptions, and PowerPC uses the corresponding MSR 
bits, bits 52 and 55, for the same purpose. However, in 
PowerPC these bits comprise a two-bit value that con- 
trols the occurrence, precision, and recoverability of 
the interrupt, while in POWER2 these bits are used 
independently to control the occurrence (bit 20) and 
the precision (bit 23) of the interrupt. Moreover, in 
PowerPC all floating-point interrupts are considered 
Program interrupts, while in POWER2 imprecise float- 
ing-point interrupts have their own interrupt vector 
location. 


E.32.4 Trace 


The Trace interrupt vector location differs between the 
two architectures, and there are many other differ- 
ences. 
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E.32.5 Deleted Instructions 


The following instructions are new in POWER2 imple- 
mentations of the POWER Architecture but have been 
dropped from the PowerPC Architecture. 


Ifq Load Floating-Point Quad 

Ifqu Load Floating-Point Quad with Update 

Ifqux Load Floating-Point Quad with Update 
Indexed 

Ifqx Load Floating-Point Quad Indexed 

stfq Store Floating-Point Quad 

stfqu Store Floating-Point Quad with Update 

stfqux Store Floating-Point Quad with Update 
Indexed 

stfqx Store Floating-Point Quad Indexed 


E.32.6 Discontinued Opcodes 


The opcodes listed below are new in POWER2 imple- 
mentations of the POWER Architecture but have been 
dropped from the PowerPC Architecture. The list con- 
tains the POWER2 mnemonic (MNEM), the primary 
opcode (PRI), and the extended opcode (XOP) if appro- 
priate. The corresponding instructions are reserved in 
PowerPC. 


MNEM PRI XOP 
Ifq 56 = 
Ifqu 57 - 
Ifqux 31 823 
Ifqx 31 791 
stfq 60 - 
stfqu 61 - 
stfqux 31 951 
stfqx 31 919 
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Appendix F. New Instructions 


The following instructions in the PowerPC User Instruc- 
tion Set Architecture are new; they are not in the 


POWER Architecture. 


The fres, frsqrte, fsel, and 


fsqrt[ s] instructions are optional. 


cntlizd 
divd 
divdu 
divw 
divwu 
extsb 
extsw 
fadds 
fcfid 
fetid 
fetidz 


fctiw 
fetiwz 


fdivs 
fmadds 
fmsubs 
fmuls 
fnmadds 
fnmsubs 
fres 
frsqrte 
fsel 
fsqri[ s] 
fsubs 

Id 

Idu 

Idux 

Idx 


Count Leading Zeros Doubleword 

Divide Doubleword 

Divide Doubleword Unsigned 

Divide Word 

Divide Word Unsigned 

Extend Sign Byte 

Extend Sign Word 

Floating Add Single 

Floating Convert From Integer Doubleword 
Floating Convert To Integer Doubleword 
Floating Convert To Integer Doubleword 
with round toward Zero 

Floating Convert To Integer Word 

Floating Convert To Integer Word with 
round toward Zero 

Floating Divide Single 

Floating Multiply-Add Single 

Floating Multiply-Subtract Single 

Floating Multiply Single 

Floating Negative Multiply-Add Single 
Floating Negative Multiply-Subtract Single 
Floating Reciprocal Estimate Single 
Floating Reciprocal Square Root Estimate 
Floating Select 

Floating Square Root [Single] 

Floating Subtract Single 

Load Doubleword 

Load Doubleword with Update 

Load Doubleword with Update Indexed 
Load Doubleword Indexed 


lwa 
lwaux 
lwax 
mfocrf 
mtocrf 
mulhd 
mulhdu 
mulhw 
mulhwu 
mulld 
rldel 
rlder 
ridic 


ridicl 


rldicr 


rldimi 


sid 
srad 
sradi 


srd 
std 
stdu 
stdux 
stdx 
stfiwx 


subf 
td 
tdi 


Load Word Algebraic 

Load Word Algebraic with Update Indexed 
Load Word Algebraic Indexed 

Move From One Condition Register Field 
Move To One Condition Register Field 
Multiply High Doubleword 

Multiply High Doubleword Unsigned 
Multiply High Word 

Multiply High Word Unsigned 

Multiply Low Doubleword 

Rotate Left Doubleword then Clear Left 
Rotate Left Doubleword then Clear Right 
Rotate Left Doubleword Immediate then 
Clear 

Rotate Left Doubleword Immediate then 
Clear Left 

Rotate Left Doubleword Immediate then 
Clear Right 

Rotate Left Doubleword Immediate then 
Mask Insert 

Shift Left Doubleword 

Shift Right Algebraic Doubleword 

Shift Right Algebraic Doubleword Imme- 
diate 

Shift Right Doubleword 

Store Doubleword 

Store Doubleword with Update 

Store Doubleword with Update Indexed 
Store Doubleword Indexed 
Store Floating-Point as 
Indexed 

Subtract From 

Trap Doubleword 

Trap Doubleword Immediate 


Integer Word 
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Appendix G. Illegal Instructions 


With the exception of the instruction consisting entirely 
of binary Os, the instructions in this class are available 
for future extensions of the PowerPC Architecture; that 
is, some future version of the PowerPC Architecture 
may define any of these instructions to perform new 
functions. 


The following primary opcodes are illegal. 
1,4,5,6 


The following primary opcodes have unused extended 
opcodes. Their unused extended opcodes can be 
determined from the opcode maps in Appendix |. All 
unused extended opcodes are illegal. 


19, 30, 31, 56, 57, 58, 59, 60, 61, 62, 63 
An instruction consisting entirely of binary Os is illegal, 


and is guaranteed to be illegal in all future versions of 
this architecture. 
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Appendix H. Reserved Instructions 


The instructions in this class are allocated to specific 
purposes that are outside the scope of the PowerPC 
User Instruction Set Architecture, PowerPC Virtual 
Environment Architecture, and PowerPC Operating 
Environment Architecture. 


The following types of instruction are included in this 
class. 


Te 


The instruction having primary opcode 0, except 
the instruction consisting entirely of binary Os 
(which is an illegal instruction; see Section 1.8.2, 
“Illegal Instruction Class” on page 12) and the 
extended opcode shown below. 


256 Service Processor “Attention” (PowerPC 
only) 


Instructions for the POWER Architecture that have 
not been included in the PowerPC Architecture. 
These are listed in Section E.31, “Discontinued 
Opcodes” on page 169 and Section E.32.6, “Dis- 
continued Opcodes’” on page 171. 


Implementation-specific instructions used _ to 
conform to the PowerPC Architecture specification. 


Any other instructions contained in Book IV, 
PowerPC Implementation Features for any imple- 
mentation, that are not defined in the PowerPC 
User Instruction Set Architecture, PowerPC Virtual 
Environment Architecture, or PowerPC Operating 
Environment Architecture. 
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Appendix |. Opcode Maps 


This section contains tables showing the opcodes and 
extended opcodes in all members of the POWER archi- 
tecture family. 


For the primary opcode table (Table 12 on page 181), 
each cell is in the following format. 


Opcode in 
Decimal 


Opcode in 
Hexadecimal 


Instruction 
Mnemonic 


Instruction 
Format 


Applicable 
Machines 





“Applicable Machines” identifies the POWER architec- 
ture family members that recognize the opcode, 
encoded as follows: 


A PowerPC 
P= PowerPC 
2  POWER2 


O Original POWER (RS/6000) 
All All of the above 


The extended opcode tables show the extended opcode 
in decimal, the instruction mnemonic, the applicable 
machines, and the instruction format. These tables 
appear in order of primary opcode within two groups. 
The first group consists of the primary opcodes that 
have small extended opcode fields (2-4 bits), namely 
30, 56, 57, 58, 60, 61, and 62. The second group con- 
sists of primary opcodes that have 10-bit extended 
opcode fields. The tables for the second group are 
rotated. 


In the extended opcode tables several special 
markings are used. 


= A prime (’) following an instruction mnemonic 
denotes an additional cell, after the lowest- 
numbered one, used by the instruction. For 
example, subfc occupies cells 8 and 520 of primary 
opcode 31, with the former corresponding to 
OE=0 and the latter to OE=1. Similarly, sradi 
occupies cells 826 and 827, with the former corre- 


sponding to shs=0 and the latter to shz=1 (the 
9-bit extended opcode 413, shown on page 76, 
excludes the she bit). 


= Two vertical bars (||) are used instead of primed 
mnemonics when an instruction occupies an entire 
column of a table. The instruction mnemonic is 
repeated in the last cell of the column. 


= For primary opcode 31, an asterisk (*) in a cell that 
would otherwise be empty means that the cell is 
reserved because it is “overlaid”, by a fixed-point 
or Storage Access instruction having only a 
primary opcode, by an instruction having an 
extended opcode in primary opcode 30, 58, or 62, 
or by a potential instruction in any of the catego- 
ries just mentioned. The overlaying instruction, if 
any, is also shown. A cell thus reserved should 
not be assigned to an instruction having primary 
opcode 31. (The overlaying is a consequence of 
opcode decoding for fixed-point instructions: the 
primary opcode, and the extended opcode if any, 
are mapped internally to a 10-bit “compressed 
opcode” for ease of subsequent decoding.) 


m Parentheses around the opcode or extended 
opcode mean that the instruction was defined in 
earlier versions of the PowerPC Architecture but is 
no longer defined in the PowerPC Architecture. 


An empty cell, a cell containing only an asterisk, or a 
cell in which the opcode or extended opcode is paren- 
thesized, corresponds to an illegal instruction. 


When instruction names and/or mnemonics differ 
among the family members, the PowerPC/PowerPC ter- 
minology is used. 


The instruction consisting entirely of binary Os causes 
the system illegal instruction error handler to be 
invoked for all members of the POWER family, and this 
is likely to remain true in future models (it is guaran- 
teed in the PowerPC Architecture). An instruction 
having primary opcode 0 but not consisting entirely of 
binary Os is reserved except for the following extended 
opcode (instruction bits 21:30). 


256 Service Processor “Attention” (PowerPC only) 
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Table 12. Primary opcodes 


Reserved 
| 


All D|All 
36 24 | 37 

stw stwu 
All D|All 
40 28) 41 

Ihz Ihzu 
All D|All 
44 2C|45 

sth sthu 
All D|All 
48 30) 49 

lfs lfsu 


Ifq 


3 illegal 
Ds|2 


stfq, 
3 illegal 
DS|2 





D|All D|All D 
25| 38 26|39 27 
stb stbu 
D|All D|All D 
29) 42 2A|43 2B 
Ilha Ihau 
D|All D|All D 
2D | 46 2E| 47 2F 
Imw stmw 
D|All D|All D 
31] 50 32/51 33 
lfd lfdu 


FX DS-form 
Loads 


FP Single 
Extended Ops 
DS|AP A 


3F 
FP Double 
Extended Ops 
All 


62 3E 
FX DS-Form 
Stores 
S|AP DS 





See primary opcode 0 extensions on page 179 


Trap Doubleword Immediate 
Trap Word Immediate 


Multiply Low Immediate 


Subtract From Immediate Carrying 
Difference or Zero Immediate 
Compare Logical Immediate 
Compare Immediate 


Add Immediate Carrying 

Add Immediate Carrying and Record 
Add Immediate 

Add Immediate Shifted 


Branch Conditional 
System Call 

Branch 

See Table 20 on page 184 


Rotate Left Word Imm. then Mask Insert 
Rotate Left Word Imm. then AND with Mask 
Rotate Left then Mask Insert 

Rotate Left Word then AND with Mask 


OR Immediate 

OR Immediate Shifted 
XOR Immediate 

XOR Immediate Shifted 


AND Immediate 

AND Immediate Shifted 
See Table 13 0n page 182 
See Table 21 on page 186 


Load Word and Zero 
Load Word and Zero with Update 
Load Byte and Zero 
Load Byte and Zero with Update 


Store Word 
Store Word with Update 
Store Byte 
Store Byte with Update 


Load Half and Zero 
Load Half and Zero with Update 
Load Half Algebraic 
Load Half Algebraic with Update 


Store Half 

Store Half with Update 
Load Multiple Word 
Store Multiple Word 


Load Floating-Point Single 
Load Floating-Point Single with Update 
Load Floating-Point Double 
Load Floating-Point Double with Update 


Store Floating-Point Single 
Store Floating-Point Single with Update 
Store Floating-Point Double 
Store Floating-Point Double with Update 


See Table 140n page 183 
See Table 15 0n page 183 
See Table 16 on page 183 
See Table 22 on page 188 


See Table 17 on page 183 
See Table 18 0n page 183 
See Table 19 0n page 183 
See Table 23 on page 190 
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Table 13. Extended opcodes for primary opcode 30 
(instruction bits 27:30) 





rldicr rldicr 
AP AP 
MD MD 


rldim’ 
AP 
MD 
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Table 14. Extended opcodes for primary opcode 56 
(instruction bits 30:31) 


Table 15. Extended opcodes for primary opcode 57 


(instruction bits 30:31) 

















Table 16. Extended opcodes for primary opcode 58 
(instruction bits 30:31) 








0 1 0 1 
0 0 
Ifq Ifqu 
0 2 0 2 
DS DS 





Table 17. Extended opcodes for primary opcode 60 
(instruction bits 30:31) 

















0 1 0 1 
0 1 0 
0 Id Idu 0 stfq 
AP AP 2 
DS DS DS 
2 
lwa 
1 AP 1 
DS 























Table 18. Extended opcodes for primary opcode 61 
(instruction bits 30:31) 








(instruction bits 30:31) 


Table 19. Extended opcodes for primary opcode 62 





0 1 0 1 
0 0 1 
0 stfqu 0 std stdu 
2 AP AP 
DS DS DS 
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Appendix J. PowerPC Instruction Set Sorted by Opcode 


This appendix lists all the instructions in the PowerPC 
Architecture, in order by opcode. A page number is 
shown for instructions that are defined in this Book 
(Book I, PowerPC User Instruction Set Architecture), 
and the Book number is shown for instructions that are 


defined in other Books (Book II, PowerPC Virtual Envi- 
ronment Architecture, and Book Ill, PowerPC Oper- 
ating Environment Architecture). If an instruction is 
defined in more than one of these Books, the lowest- 
numbered Book is used. 








For ohcone mode: kage! Mnemonic 
Primary| Extend |Dep.') Bk 

D 2 60 | tdi 

D 3 60 | twi 

D 7 54 | mulli 

D 8 SR 51 | subfic 

D 10 59 | cmpli 

D 11 58 | cmpi 

D 12 SR 50 | addic 

D 13 SR 50 | addic. 

D 14 49 | addi 

D 15 49 | addis 

B 16 CT 23 | be[l][al] 

SC 17 25 | sc 

| 18 23 | b[l][a] 

XL 19 0 28 | mcrf 

XL 19 16 CT 24 | belr[l] 

XL 19 18 IIL] rfid 

XL 19 33 27 | crnor 

XL 19 129 27 | crandc 

XL 19 150 Il} isyne 

XL 19 193 26 | crxor 

XL 19 225 26 | crnand 

XL 19 257 26 | crand 

XL 19 289 27 | creqv 

XL 19 417 27 | crorc 

XL 19 449 26 | cror 

XL 19 528 CT 24 | bectr[l] 

M 20 SR 73° «| rlwimil.] 

M 21 SR 70 | rlwinm[.] 

M 23 SR 72 =| rlwnm[.] 

D 24 63 | ori 

D 25 63 | oris 

D 26 63 | xori 

D 27 63 | xoris 

D 28 SR 62 | andi. 

D 29 SR 62 | andis. 

MD 30 0 SR 69 | ridicl[.] 

MD 30 1 SR 69 | rldicr[.] 

MD 30 2 SR 70 | ridic[.] 

MD 30 3 SR 73 | rldimif.] 

MDS 30 8 SR 71 =| ridel[.] 

MDS 30 9 SR 72 | rlder[.] 








Instruction 


Trap Doubleword Immediate 

Trap Word Immediate 

Multiply Low Immediate 

Subtract From Immediate Carrying 

Compare Logical Immediate 

Compare Immediate 

Add Immediate Carrying 

Add Immediate Carrying and Record 

Add Immediate 

Add Immediate Shifted 

Branch Conditional 

System Call 

Branch 

Move Condition Register Field 

Branch Conditional to Link Register 

Return from Interrupt Doubleword 

Condition Register NOR 

Condition Register AND with Complement 
Instruction Synchronize 

Condition Register XOR 

Condition Register NAND 

Condition Register AND 

Condition Register Equivalent 

Condition Register OR with Complement 
Condition Register OR 

Branch Conditional to Count Register 

Rotate Left Word Immediate then Mask Insert 
Rotate Left Word Immediate then AND with Mask 
Rotate Left Word then AND with Mask 

OR Immediate 

OR Immediate Shifted 

XOR Immediate 

XOR Immediate Shifted 

AND Immediate 

AND Immediate Shifted 

Rotate Left Doubleword Immediate then Clear Left 
Rotate Left Doubleword Immediate then Clear Right 
Rotate Left Doubleword Immediate then Clear 
Rotate Left Doubleword Immediate then Mask Insert 
Rotate Left Doubleword then Clear Left 
Rotate Left Doubleword then Clear Right 
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XO 


XO 
XFX 


x< 
on 
x< 


SEALS PGES INN CIES LS OREN CO PET PATE 


(oxe) 





Opcode 
Primary | Extend 

31 0 
31 4 
31 8 
31 9 
31 10 
31 11 
31 19 
31 19 
31 20 
31 21 
31 23 
31 24 
31 26 
31 27 
31 28 
31 32 
31 40 
31 53 
31 54 
31 55 
31 58 
31 60 
31 68 
31 73 
31 75 
31 83 
31 84 
31 86 
31 87 
31 104 
31 119 
31 124 
31 136 
31 138 
31 144 
31 144 
31 146 
31 149 
31 150 
31 151 
31 178 
31 181 
31 183 
31 200 
31 202 
31 210 
31 214 
31 215 
31 232 
31 233 
31 234 
31 235 
31 242 
31 246 
31 247 
31 266 
31 278 
31 279 





Mode 
Dep.' 


SR 
SR 


SR 


SR 


SR 
SR 


SR 


SR 
SR 


SR 
SR 


SR 
SR 


SR 


SR 


32 


SR 
SR 
SR 
SR 
32 


SR 





Mnemonic 


cmp 
tw 
subfc[o][.] 
mulhdu[.] 
addc[o][.] 
mulhwu[.] 
mfcr 

mfocrf 
lwarx 

Idx 

Iwzx 

slw[.] 
cntlzw[.] 
sld[.] 
and[.] 
cmpl 
subf[o][.] 
Idux 

dcbst 

lwzux 
entlzd[.] 
andc[.] 

td 

mulhd[.] 
mulhw[.] 
mfmsr 
Idarx 

dcbf 

lbzx 
neg[o][.] 
lbzux 
nor[.] 
subfe[o][.] 
adde[o][.] 
mtcrf 
mtocrf 
mtmsr 

stdx 

stwcx. 

stwXx 
mtmsrd 
stdux 
stwux 
subfze[o][.] 
addze[o][.] 
mtsr 

stdex. 

stbx 
subfme[o][.] 
mulld[o][.] 
addme[o][.] 
mullw[o][.] 
mtsrin 
dcbtst 
stbux 
add[o][.] 
dcbt 

Ihzx 


Instruction 


Compare 

Trap Word 

Subtract From Carrying 

Multiply High Doubleword Unsigned 
Add Carrying 

Multiply High Word Unsigned 

Move From Condition Register 

Move From One Condition Register Field 
Load Word And Reserve Indexed 

Load Doubleword Indexed 

Load Word and Zero Indexed 

Shift Left Word 

Count Leading Zeros Word 

Shift Left Doubleword 

AND 

Compare Logical 

Subtract From 

Load Doubleword with Update Indexed 
Data Cache Block Store 

Load Word and Zero with Update Indexed 
Count Leading Zeros Doubleword 

AND with Complement 

Trap Doubleword 

Multiply High Doubleword 

Multiply High Word 

Move From Machine State Register 
Load Doubleword And Reserve Indexed 
Data Cache Block Flush 

Load Byte and Zero Indexed 

Negate 

Load Byte and Zero with Update Indexed 
NOR 

Subtract From Extended 

Add Extended 

Move To Condition Register Fields 
Move To One Condition Register Field 
Move To Machine State Register 

Store Doubleword Indexed 

Store Word Conditional Indexed 

Store Word Indexed 

Move To Machine State Register Doubleword 
Store Doubleword with Update Indexed 
Store Word with Update Indexed 
Subtract From Zero Extended 

Add to Zero Extended 

Move To Segment Register 

Store Doubleword Conditional Indexed 
Store Byte Indexed 

Subtract From Minus One Extended 
Multiply Low Doubleword 

Add to Minus One Extended 

Multiply Low Word 

Move To Segment Register Indirect 
Data Cache Block Touch for Store 
Store Byte with Update Indexed 

Add 

Data Cache Block Touch 

Load Halfword and Zero Indexed 





194 PowerPC User Instruction Set Architecture 





Version 2.01 








a 
x< 


ua 
x< 


OO Oe Pata Po Om) ONE ony MO Pape ee Pm ea Pa 
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Opcode 
Primary | Extend 

31 284 
31 306 
31 310 
31 311 
31 316 
31 339 
31 341 
31 343 
31 370 
31 371 
31 373 
31 375 
31 402 
31 407 
31 412 
31 413 
31 434 
31 438 
31 439 
31 444 
31 457 
31 459 
31 467 
31 476 
31 489 
31 491 
31 498 
31 512 
31 533 
31 534 
31 535 
31 536 
31 539 
31 566 
31 567 
31 595 
31 597 
31 598 
31 599 
31 631 
31 659 
31 661 
31 662 
31 663 
31 695 
31 725 
31 727 
31 759 
31 790 
31 792 
31 794 
31 824 
31 851 
31 854 
31 915 
31 918 
31 922 
31 954 





Mode 
Dep.' 


SR 
64 


SR 


SR 
SR 


SR 
SR 
SR 


SR 


SR 
SR 


32 


32 


SR 


SR 
SR 


SR 
SR 





Mnemonic 


eqv[.] 
tlbie 
eciwx 
Ihzux 
xor[.] 
mfspr 
lwax 
Ihax 
tlbia 
mftb 
Iwaux 
Ihaux 
slbmte 
sthx 
orc[.] 
sradi[.] 
slbie 
ecowx 
sthux 
or[.] 
divdu[o][.] 
divwu[o][.] 
mtspr 
nand[.] 
divd[o][.] 
divw[o][.] 
slbia 
merxr 
Iswx 
Iwbrx 
lfsx 
srw[.] 
srd[.] 
tlbsync 
lfsux 
mfsr 
Iswi 
sync 
lfdx 
Ifdux 
mfsrin 
stswx 
stwbrx 
stfsx 
stfsux 
stswi 
stfdx 
stfdux 
Ihbrx 
sraw[.] 
srad[.] 
srawi[.] 
slomfev 
eieio 
slomfee 
sthbrx 
extsh[.] 
extsb[.] 


Instruction 


Equivalent 

TLB Invalidate Entry 

External Control In Word Indexed 

Load Halfword and Zero with Update Indexed 
XOR 

Move From Special Purpose Register 

Load Word Algebraic Indexed 

Load Halfword Algebraic Indexed 

TLB Invalidate All 

Move From Time Base 

Load Word Algebraic with Update Indexed 
Load Halfword Algebraic with Update Indexed 
SLB Move To Entry 

Store Halfword Indexed 

OR with Complement 

Shift Right Algebraic Doubleword Immediate 
SLB Invalidate Entry 

External Control Out Word Indexed 

Store Halfword with Update Indexed 

OR 

Divide Doubleword Unsigned 

Divide Word Unsigned 

Move To Special Purpose Register 

NAND 

Divide Doubleword 

Divide Word 

SLB Invalidate All 

Move to Condition Register from XER 

Load String Word Indexed 

Load Word Byte-Reverse Indexed 

Load Floating-Point Single Indexed 

Shift Right Word 

Shift Right Doubleword 

TLB Synchronize 

Load Floating-Point Single with Update Indexed 
Move From Segment Register 

Load String Word Immediate 

Synchronize 

Load Floating-Point Double Indexed 

Load Floating-Point Double with Update Indexed 
Move From Segment Register Indirect 

Store String Word Indexed 

Store Word Byte-Reverse Indexed 

Store Floating-Point Single Indexed 

Store Floating-Point Single with Update Indexed 
Store String Word Immediate 

Store Floating-Point Double Indexed 

Store Floating-Point Double with Update Indexed 
Load Halfword Byte-Reverse Indexed 

Shift Right Algebraic Word 

Shift Right Algebraic Doubleword 

Shift Right Algebraic Word Immediate 

SLB Move From Entry VSID 

Enforce In-order Execution of I/O 

SLB Move From Entry ESID 

Store Halfword Byte-Reverse Indexed 

Extend Sign Halfword 

Extend Sign Byte 
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Opcode Mode 
rm ‘ 
Primary| Extend | Dep. 
X 31 982 
X 31 983 
X 31 986 SR 
X 31 1014 
D 32 
D 33 
D 34 
D 35 
D 36 
D 37 
D 38 
D 39 
D 40 
D 41 
D 42 
D 43 
D 44 
D 45 
D 46 
D 47 
D 48 
D 49 
D 50 
D 51 
D 52 
D 53 
D 54 
D 55 
DS 58 0 
DS 58 1 
DS 58 2 
A 59 18 
A 59 20 
A 59 21 
A 59 22 
A 59 24 
A 59 25 
A 59 28 
A 59 29 
A 59 30 
A 59 31 
DS 62 0 
DS 62 1 
X 63 0 
X 63 12 
X 63 14 
X 63 15 
A 63 18 
A 63 20 
A 63 21 
A 63 22 
A 63 23 
A 63 25 
A 63 26 
A 63 28 
A 63 29 
A 63 30 
A 63 31 








Mnemonic 


icbi 
stfiwx 
extsw[.] 
dcbz 

lwz 

Iwzu 

lbz 

Ibzu 

stw 

stwu 

stb 

stbu 

Ihz 

Ihzu 

lha 

lhau 

sth 

sthu 

Imw 
stmw 

lfs 

lfsu 

lfd 

lfdu 

stfs 

stfsu 

stfd 

stfdu 

Id 

Idu 

lwa 
fdivs[.] 
fsubs[.] 
fadds[.] 
fsqrts[.] 
fres[.] 
fmuls[.] 
fmsubs[.] 
fmadds[.] 
fnmsubs[.] 
fnmadds[.] 
std 

stdu 
fempu 
frsp[.] 
fctiw[.] 
fctiwz[.] 
fdiv[.] 
fsub[.] 
fadd[.] 
fsqrt[.] 
fsel[.] 
fmul[.] 
frsqrte[.] 
fmsub[.] 
fmadd[.] 
fnmsub[.] 
fnmadd[.] 


Instruction 


Instruction Cache Block Invalidate 
Store Floating-Point as Integer Word Indexed 
Extend Sign Word 

Data Cache Block set to Zero 

Load Word and Zero 

Load Word and Zero with Update 

Load Byte and Zero 

Load Byte and Zero with Update 

Store Word 

Store Word with Update 

Store Byte 

Store Byte with Update 

Load Halfword and Zero 

Load Halfword and Zero with Update 
Load Halfword Algebraic 

Load Halfword Algebraic with Update 
Store Halfword 

Store Halfword with Update 

Load Multiple Word 

Store Multiple Word 

Load Floating-Point Single 

Load Floating-Point Single with Update 
Load Floating-Point Double 

Load Floating-Point Double with Update 
Store Floating-Point Single 

Store Floating-Point Single with Update 
Store Floating-Point Double 

Store Floating-Point Double with Update 
Load Doubleword 

Load Doubleword with Update 

Load Word Algebraic 

Floating Divide Single 

Floating Subtract Single 

Floating Add Single 

Floating Square Root Single 

Floating Reciprocal Estimate Single 
Floating Multiply Single 

Floating Multiply-Subtract Single 
Floating Multiply-Add Single 

Floating Negative Multiply-Subtract Single 
Floating Negative Multiply-Add Single 
Store Doubleword 

Store Doubleword with Update 

Floating Compare Unordered 

Floating Round to Single-Precision 
Floating Convert To Integer Word 
Floating Convert To Integer Word with round toward Zero 
Floating Divide 

Floating Subtract 

Floating Add 

Floating Square Root 

Floating Select 

Floating Multiply 

Floating Reciprocal Square Root Estimate 
Floating Multiply-Subtract 

Floating Multiply-Add 

Floating Negative Multiply-Subtract 
Floating Negative Multiply-Add 
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Form ebeote made rege! Mnemonic 
Primary| Extend |Dep.'| Bk 

X 63 32 113 | fempo 
X 63 38 116 | mtfsb1[.] 
X 63 40 104 | fneg[.] 
X 63 64 114 | mcrfs 

X 63 70 116 | mtfsbO[.] 
X 63 72 104 | fmr[.] 

X 63 134 115 | mtfsfi[.] 
X 63 136 104 | fnabs[.] 
X 63 264 104 | fabs[.] 
X 63 583 114 | mffs[.] 
XFL 63 711 115 | mtfsf[.] 
X 63 814 110 | fetid[.] 
X 63 815 110 | fetidz[.] 
X 63 846 112 | fcfid[.] 








Instruction 


Zero 


Floating Compare Ordered 

Move To FPSCR Bit 1 

Floating Negate 

Move to Condition Register from FPSCR 
Move To FPSCR Bit 0 

Floating Move Register 

Move To FPSCR Field Immediate 
Floating Negative Absolute Value 
Floating Absolute Value 

Move From FPSCR 

Move To FPSCR Fields 

Floating Convert To Integer Doubleword 
Floating Convert To Integer Doubleword with round toward 


Floating Convert From Integer Doubleword 








1See key to mode dependency column, on page 203. 
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Appendix K. PowerPC Instruction Set Sorted by Mnemonic 


This appendix lists all the instructions in the PowerPC 
Architecture, in order by mnemonic. A page number 
is shown for instructions that are defined in this Book 
(Book |, PowerPC User Instruction Set Architecture), 
and the Book number is shown for instructions that 


are defined in other Books (Book Il, PowerPC Virtual 
Environment Architecture, and Book Ill, PowerPC 
Operating Environment Architecture). \f an instruction 
is defined in more than one of these Books, the 
lowest-numbered Book is used. 








Opcode Mode | Page / |i onic 
Primary| Extend Dep.') Bk 

XO 31 266 SR 50 | add[o][.] 
XO 31 10 SR 51 | addc[o][.] 
XO 31 138 SR 52 | adde[o][.] 
D 14 49 | addi 
D 12 SR 50 | addic 
D 13 SR 50 | addic. 
D 15 49 | addis 
XO 31 234 SR 52 | addme[o][.] 
XO 31 202 SR 53 | addze[o][.] 
X 31 28 SR 64 | and[.] 
X 31 60 SR 65 | andc[.] 
D 28 SR 62 | andi. 
D 29 SR 62 | andis. 
| 18 23 | b[l][a] 
B 16 CT 23 | be[l][a] 
XL 19 528 CT 24 | bectr[l] 
XL 19 16 CT 24 | belr[l] 
X 31 0 58 | cmp 
D 11 58 | cmpi 
X 31 32 59 | cmpl 
D 10 59 | cmpli 
X 31 58 SR 67 | cntlzd[.] 
Xx 31 26 SR 67 | cntlzw[.] 
XL 19 257 26 | crand 
XL 19 129 27 | crandc 
XL 19 289 27 | creqv 
XL 19 225 26 | crnand 
XL 19 33 27 | crnor 
XL 19 449 26 | cror 
XL 19 417 27 | crore 
XL 19 193 26 | crxor 
X 31 86 Il} dcebf 
xX 31 54 Il} dcbst 
X 31 278 Il} debt 
X 31 246 Il} debtst 
X 31 1014 Il} dcbz 
XO 31 489 SR 56 | divd[o][.] 
XO 31 457 SR 57 | divdu[o][.] 
XO 31 491 SR 56 | divw[o][.] 
XO 31 459 SR 57 | divwu[o][.] 
xX 31 310 Il] eciwx 








Instruction 


Add 

Add Carrying 

Add Extended 

Add Immediate 

Add Immediate Carrying 

Add Immediate Carrying and Record 
Add Immediate Shifted 

Add to Minus One Extended 

Add to Zero Extended 

AND 

AND with Complement 

AND Immediate 

AND Immediate Shifted 

Branch 

Branch Conditional 

Branch Conditional to Count Register 
Branch Conditional to Link Register 
Compare 

Compare Immediate 

Compare Logical 

Compare Logical Immediate 

Count Leading Zeros Doubleword 
Count Leading Zeros Word 
Condition Register AND 

Condition Register AND with Complement 
Condition Register Equivalent 
Condition Register NAND 

Condition Register NOR 

Condition Register OR 

Condition Register OR with Complement 
Condition Register XOR 

Data Cache Block Flush 

Data Cache Block Store 

Data Cache Block Touch 

Data Cache Block Touch for Store 
Data Cache Block set to Zero 
Divide Doubleword 

Divide Doubleword Unsigned 
Divide Word 

Divide Word Unsigned 

External Control In Word Indexed 
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Opcode 
rm 
Primary | Extend 

X 31 438 
X 31 854 
X 31 284 
X 31 954 
X 31 922 
X 31 986 
X 63 264 
A 63 21 
A 59 21 
X 63 846 
X 63 32 
X 63 0 
X 63 814 
X 63 815 
X 63 14 
X 63 15 
A 63 18 
A 59 18 
A 63 29 
A 59 29 
X 63 72 
A 63 28 
A 59 28 
A 63 25 
A 59 25 
X 63 136 
X 63 40 
A 63 31 
A 59 31 
A 63 30 
A 59 30 
A 59 24 
X 63 12 
A 63 26 
A 63 23 
A 63 22 
A 59 22 
A 63 20 
A 59 20 
X 31 982 
XL 19 150 
D 34 

D 35 

X 31 119 
X 31 87 
DS 58 0 
X 31 84 
DS 58 1 
X 31 53 
X 31 21 
D 50 

D 51 

X 31 631 
X 31 599 
D 48 

D 49 

X 31 567 





Mode 
Dep.' 


SR 
SR 


SR 





Page / 
Bk 


Mnemonic 


ecOowx 
eieio 
eqv[.] 
extsb[.] 
extsh[.] 
extsw[.] 
fabs[.] 
fadd[.] 
fadds[.] 
fcfid[ .] 
fempo 
fempu 
fctid[.] 
fctidz[.] 


fctiw[.] 
fctiwz[.] 
fdiv[.] 
fdivs[.] 
fmadd[.] 
fmadds[.] 
fmr[.] 
fmsub[.] 
fmsubs[.] 
fmul[.] 
fmuls[.] 
fnabs[.] 
fneg[.] 
fnmadd[.] 
fnmadds[.] 
fnmsub[.] 
fnmsubs[.] 
fres[.] 
frsp[.] 
frsqrte[.] 
fsel[.] 
fsqrt[.] 
fsqrts[.] 
fsub[.] 
fsubs[.] 
icbi 

isync 

lbz 

Ibzu 

lbzux 
lbzx 

Id 

Idarx 

Idu 

Idux 

Idx 

lfd 

lfdu 

Ifdux 

Ifdx 

lfs 

lfsu 

lfsux 


Instruction 


External Control Out Word Indexed 
Enforce In-order Execution of I/O 
Equivalent 

Extend Sign Byte 

Extend Sign Halfword 

Extend Sign Word 

Floating Absolute Value 

Floating Add 

Floating Add Single 

Floating Convert From Integer Doubleword 
Floating Compare Ordered 

Floating Compare Unordered 

Floating Convert To Integer Doubleword 
Floating Convert To Integer Doubleword with round 
toward Zero 

Floating Convert To Integer Word 

Floating Convert To Integer Word with round toward Zero 
Floating Divide 

Floating Divide Single 

Floating Multiply-Add 

Floating Multiply-Add Single 

Floating Move Register 

Floating Multiply-Subtract 

Floating Multiply-Subtract Single 

Floating Multiply 

Floating Multiply Single 

Floating Negative Absolute Value 

Floating Negate 

Floating Negative Multiply-Add 

Floating Negative Multiply-Add Single 
Floating Negative Multiply-Subtract 
Floating Negative Multiply-Subtract Single 
Floating Reciprocal Estimate Single 
Floating Round to Single-Precision 
Floating Reciprocal Square Root Estimate 
Floating Select 

Floating Square Root 

Floating Square Root Single 

Floating Subtract 

Floating Subtract Single 

Instruction Cache Block Invalidate 
Instruction Synchronize 

Load Byte and Zero 

Load Byte and Zero with Update 

Load Byte and Zero with Update Indexed 
Load Byte and Zero Indexed 

Load Doubleword 

Load Doubleword And Reserve Indexed 
Load Doubleword with Update 

Load Doubleword with Update Indexed 
Load Doubleword Indexed 

Load Floating-Point Double 

Load Floating-Point Double with Update 
Load Floating-Point Double with Update Indexed 
Load Floating-Point Double Indexed 

Load Floating-Point Single 

Load Floating-Point Single with Update 
Load Floating-Point Single with Update Indexed 





200 PowerPC User Instruction Set Architecture 





Version 2.01 











Opcode Mode | Page / ; ; 
{ Mnemonic Instruction 
Primary| Extend |Dep.'| Bk 
X 31 535 98 | lfsx Load Floating-Point Single Indexed 
D 42 34 | Iha Load Halfword Algebraic 
D 43 34 | lhau Load Halfword Algebraic with Update 
X 31 375 34 | Ihaux Load Halfword Algebraic with Update Indexed 
X 31 343 34 | Ihax Load Halfword Algebraic Indexed 
X 31 790 42 | lhbrx Load Halfword Byte-Reverse Indexed 
D 40 33 | Ihz Load Halfword and Zero 
D 41 33 | Ihzu Load Halfword and Zero with Update 
X 31 311 33 | Ihzux Load Halfword and Zero with Update Indexed 
X 31 279 33 | Ihzx Load Halfword and Zero Indexed 
D 46 44 | Imw Load Multiple Word 
X 31 597 46 | Iswi Load String Word Immediate 
X 31 533 46 | lswx Load String Word Indexed 
DS 58 2 36 | lwa Load Word Algebraic 
X 31 20 Il} lwarx Load Word And Reserve Indexed 
X 31 373 36 | lwaux Load Word Algebraic with Update Indexed 
X 31 341 36 | lwax Load Word Algebraic Indexed 
X 31 534 42 | lwbrx Load Word Byte-Reverse Indexed 
D 32 35 | lwz Load Word and Zero 
D 33 35 | lwzu Load Word and Zero with Update 
X 31 55 35 | lwzux Load Word and Zero with Update Indexed 
X 31 23 35 | lwzx Load Word and Zero Indexed 
XL 19 0 28 | merf Move Condition Register Field 
X 63 64 114 | merfs Move to Condition Register from FPSCR 
X 31 512 131 | merxr Move to Condition Register from XER 
XFX 31 19 80 | mfcr Move From Condition Register 
XFX 31 19 118 | mfocrf Move From One Condition Register Field 
xX 63 583 114 | mffs[.] Move From FPSCR 
X 31 83 Ill] mfmsr Move From Machine State Register 
XFX 31 339 79 | mfspr Move From Special Purpose Register 
X 31 595 32 Ill} mfsr Move From Segment Register 
X 31 659 32 Ill} mfsrin Move From Segment Register Indirect 
XFX 31 371 Il} mftb Move From Time Base 
XFX 31 144 80 | mtcrf Move To Condition Register Fields 
XFX 31 144 118 | mtocrf Move To One Condition Register Field 
X 63 70 116 | mtfsbO[.] Move To FPSCR Bit 0 
Xx 63 38 116 | mtfsb1[.] Move To FPSCR Bit 1 
XFL 63 711 115 | mtfsf[.] Move To FPSCR Fields 
xX 63 134 115 | mtfsfi[.] Move To FPSCR Field Immediate 
X 31 146 Ill} mtmsr Move To Machine State Register 
X 31 178 Ill} mtmsrd Move To Machine State Register Doubleword 
XFX 31 467 78 | mtspr Move To Special Purpose Register 
X 31 210 32 Ill} mtsr Move To Segment Register 
X 31 242 32 Ill} mtsrin Move To Segment Register Indirect 
XO 31 73 SR 55 | mulhd[.] Multiply High Doubleword 
XO 31 9 SR 55 | mulhdu[.] Multiply High Doubleword Unsigned 
XO 31 75 SR 55 | mulhw[.] Multiply High Word 
XO 31 11 SR 55 | mulhwu[.] Multiply High Word Unsigned 
XO 31 233 SR 54 | mulld[o][.] Multiply Low Doubleword 
D 7 54 | mulli Multiply Low Immediate 
XO 31 235 SR 54 | mullw[o][.] Multiply Low Word 
X 31 476 SR 64 | nand[.] NAND 
XO 31 104 SR 53 | neg[o][.] Negate 
X 31 124 SR 65 | nor[.] NOR 
X 31 444 SR 64 | or[.] OR 
X 31 412 SR 65 | orc[.] OR with Complement 
D 24 63 | ori OR Immediate 
D 25 63 | oris OR Immediate Shifted 
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Opcode 
orm 
Primary | Extend 

XL 19 18 
MDS 30 8 
MDS 30 9 
MD 30 2 
MD 30 0 
MD 30 1 
MD 30 3 
M 20 

M 21 

M 23 

SC 17 

X 31 498 
X 31 434 
X 31 915 
X 31 851 
X 31 402 
X 31 27 
X 31 24 
X 31 794 
XS 31 413 
X 31 792 
X 31 824 
X 31 539 
X 31 536 
D 38 

D 39 

X 31 247 
X 31 215 
DS 62 0 
X 31 214 
DS 62 1 
X 31 181 
X 31 149 
D 54 

D 55 

X 31 759 
X 31 727 
X 31 983 
D 52 

D 53 

X 31 695 
X 31 663 
D 44 

X 31 918 
D 45 

X 31 439 
X 31 407 
D 47 

X 31 725 
X 31 661 
D 36 

X 31 662 
X 31 150 
D 37 

X 31 183 
X 31 151 
XO 31 40 
XO 31 8 





Mode 
Dep.' 


SR 
SR 
SR 


SR 
SR 
SR 


SR 


SR 
SR 
SR 
SR 


SR 
SR 
SR 


SR 
SR 





Page / 
Bk 


Mnemonic 


rfid 
ridcl{.] 
rilder[.] 
ridic[.] 
ridicl[.] 
ridicr[.] 
ridimi[.] 
rlwimi[.] 
rlwinm[.] 
rlwnm[.] 
sc 

slbia 
slbie 
slomfee 
slomfev 
slobmte 
sld[.] 
slw[.] 
srad[.] 
sradi[.] 
sraw[.] 
srawi[.] 
srd[.] 
srw[.] 
stb 

stbu 
stbux 
stbx 

std 
stdex. 
stdu 
stdux 
stdx 
stfd 
stfdu 
stfdux 
stfdx 
stfiwx 
stfs 
stfsu 
stfsux 
stfisx 
sth 
sthbrx 
sthu 
sthux 
sthx 
stmw 
stswi 
stswXx 
stw 
stwbrx 
stwcx. 
stwu 
stwux 
stwx 
subf[o][.] 
subfc[o][.] 


Instruction 


Return from Interrupt Doubleword 

Rotate Left Doubleword then Clear Left 

Rotate Left Doubleword then Clear Right 
Rotate Left Doubleword Immediate then Clear 
Rotate Left Doubleword Immediate then Clear Left 
Rotate Left Doubleword Immediate then Clear Right 
Rotate Left Doubleword Immediate then Mask Insert 
Rotate Left Word Immediate then Mask Insert 
Rotate Left Word Immediate then AND with Mask 
Rotate Left Word then AND with Mask 

System Call 

SLB Invalidate All 

SLB Invalidate Entry 

SLB Move From Entry ESID 

SLB Move From Entry VSID 

SLB Move To Entry 

Shift Left Doubleword 

Shift Left Word 

Shift Right Algebraic Doubleword 

Shift Right Algebraic Doubleword Immediate 
Shift Right Algebraic Word 

Shift Right Algebraic Word Immediate 

Shift Right Doubleword 

Shift Right Word 

Store Byte 

Store Byte with Update 

Store Byte with Update Indexed 

Store Byte Indexed 

Store Doubleword 

Store Doubleword Conditional Indexed 

Store Doubleword with Update 

Store Doubleword with Update Indexed 

Store Doubleword Indexed 

Store Floating-Point Double 

Store Floating-Point Double with Update 

Store Floating-Point Double with Update Indexed 
Store Floating-Point Double Indexed 

Store Floating-Point as Integer Word Indexed 
Store Floating-Point Single 

Store Floating-Point Single with Update 

Store Floating-Point Single with Update Indexed 
Store Floating-Point Single Indexed 

Store Halfword 

Store Halfword Byte-Reverse Indexed 

Store Halfword with Update 

Store Halfword with Update Indexed 

Store Halfword Indexed 

Store Multiple Word 

Store String Word Immediate 

Store String Word Indexed 

Store Word 

Store Word Byte-Reverse Indexed 

Store Word Conditional Indexed 

Store Word with Update 

Store Word with Update Indexed 

Store Word Indexed 

Subtract From 

Subtract From Carrying 
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Form oheone mode Rade 7 Mnemonic 
Primary| Extend |Dep.') Bk 

XO 31 136 SR 52 | subfe[o][.] 

D 8 SR 51 | subfic 

XO 31 232 SR 52 | subfme[o][.] 

XO 31 200 SR 53 | subfze[o][.] 

X 31 598 Il} sync 

X 31 68 61 | td 

D 2 60 | tdi 

X 31 370 Ill} tlbia 

X 31 306 64 Ill} tlbie 

X 31 566 Ill] tlosync 

X 31 4 61 | tw 

D 3 60 | twi 

X 31 316 SR 64 | xor[.] 

D 26 63 | xori 

D 27 63 | xoris 








Instruction 


Subtract From Extended 

Subtract From Immediate Carrying 
Subtract From Minus One Extended 
Subtract From Zero Extended 
Synchronize 

Trap Doubleword 

Trap Doubleword Immediate 

TLB Invalidate All 

TLB Invalidate Entry 

TLB Synchronize 

Trap Word 

Trap Word Immediate 

XOR 

XOR Immediate 

XOR Immediate Shifted 








1Key to Mode Dependency Column 


Except as described below and in Section 1.12.2, 
“Effective Address Calculation” on page 14, all 
instructions are independent of whether the processor 
is in 32-bit or 64-bit mode. 


CT If the instruction tests the Count Register, it 
tests the low-order 32 bits in 32-bit mode and 
all 64 bits in 64-bit mode. 

SR The setting of status registers (such as XER 


and CRO) is mode-dependent. 


32 The instruction must be executed only in 
32-bit mode. 
64 The instruction must be executed only in 


64-bit mode. 
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Index 


[A] 


abit 20 

A-form 10 

AA field 10 

address 14 
effective 14 

assembler language 
extended mnemonics 143 
mnemonics 143 
symbols 143 


B-form 8 

BA field 10 

BB field 10 

BD field 10 

BF field 10 

BFA field 10 

BH field 10 

BI field 10 
Big-Endian 122 
BO field 10, 20 
boundedly undefined 3 
BT field 10 

byte ordering 122 
bytes 2 


C 84 
CA 30 
CIA 4 
CR 18 
CTR 19 


[| 


D field 11 

D-form 8 

defined instructions 12 
denormalization 87 


denormalized number 86 
double-precision 88 
doublewords 2 

DS field 11 

DS-form 8 


| 


EA 14 
effective address 14 
EQ 18, 19 


F | 


facilities 
optional 13 
FE 19, 84 
FEX 83 
FG 19, 84 
Fl 84 
FL 18, 84 
FLM field 11 
floating-point 
denormalization 87 
double-precision 88 
exceptions 82, 89 
inexact 94 
invalid operation 91 
overflow 93 
underflow 93 
zero divide 92 
execution models 94 
normalization 87 


number 
denormalized 86 
infinity 86 


normalized 86 
not anumber 86 
zero 86 
rounding 89 
sign 87 
single-precision 88 
FPCC 84 
FPR 82 
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FPRF 84 
FPSCR 83 
C 84 

FE 84 
FEX 83 
FG 84 
Fl 84 


VXCVI 84 
VXIDI 83 
VXIMZ 84 
VXISI 83 
VXSNAN 83 
VXSOFT 84 
VXSQRT 84 
VXVC 84 
VXZDZ 84 
XE 84 


FR 84 

FRA field 11 
FRB field 11 
FRC field 11 
FRS field 11 
FRT field 11 
FU 19, 84 
FX 83 

FXM field 11 


[e 


GPR 29 
GT 18, 19 
Gulliver's Travels 122 


[LH 


halfwords 2 


hardware description language 4 


[| 


l-form 7 

illegal instructions 12 

inexact 94 

infinity 86 

instruction 

fields 10, 11, 12 

AA 10 
BA 10 
BB 10 


Ul 12 
XO 12 
formats 7, 8,9, 10 
A-form 10 
B-form 8 
D-form 8 
DS-form 8 
l-form 7 
M-form 10 
MD-form 10 
MDS-form 10 
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instruction (continued) 
formats (continued) 
SC-form 8 
X-form 9 
XFL-form 9 
XFX-form 9 
XL-form 9 
XO-form 9 
XS-form 9 
instructions 
classes 12 
defined 12 
forms 13 
illegal 12 
invalid forms 13 
optional 13 
preferred forms 13 
reserved 12 
invalid instruction forms 13 
invalid operation 91 


L field 11 
language used for instruction operation description 4 
LEV field 11 


LI field 11 
Little-Endian 122 
LK field 11 
LR 19 

LT 18 
M-form 10 
MB field 11 
MD-form 10 
MDS-form 10 
ME field 11 
mnemonics 


extended 143 


[LN 


NB field 11 

NI 84 

NIA 4 

no-op 63 
normalization 87 
normalized number 86 
not anumber 86 


[0 


OE 84 

OE field 11 

optional facilities 131 
optional facility 13 
optional instruction 13 
OV 30 

overflow 93 

OX 83 


[P| 


preferred instruction forms 13 


[a 


quadwords 2 


[F 


RA field 11 

RB field 11 

Rec field 11 

register transfer level language 4 

registers 
Condition Register 18 
Count Register 19 
Fixed-Point Exception Register 30 
Floating-Point Registers 82 
Floating-Point Status and Control Register 83 
General Purpose Registers 29 
Link Register 19 

reserved field 3 

reserved instructions 12 


RN 85 

rounding 89 

RS field 11 

RT field 11 

RTL 4 

SC-form 8 
sequential execution model 17 
SH field 11 

Sl field 11 

sign 87 
single-precision 88 
SO 18, 19, 30 

split field notation 7 
SPR field 11 

SR field 12 


storage access 
floating-point 97 
storage address 14 
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Swift, Jonathan 122 
symbols 143 


t bit 20 

TBR field 12 
TH field 12 
TO field 12 


U field 12 


Ul field 12 
undefined 

boundedly 3 
underflow 93 
UX 83 


VE 84 

VX 83 
VXCVI 84 
VXIDI 83 
VXIMZ 84 
VXISI 83 
VXSNAN 83 
VXSOFT 84 
VXSQRT 84 
VXVC 84 
VXZDZ 84 


words 2 


X-form 9 
XE 84 
XER_ 30 
XFL-form 9 
XFX-form 9 
XL-form 9 
XO field 12 
XO-form 
XS-form 9 
XX 83 


2 


zbit 20 

ZE 84 

zero 86 

zero divide 92 
ZX 83 
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